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BY 

JORDAN J.N. TANG 
LIN HONG 
AND 
ARUN K. GHOSH 
FOR 

INHIBITORS OF MEMAPSIN 2 AND USE THEREOF 



INHIBITORS OF MEMAPSIN 2 AND USE THEREOF 
Background of the Invention 

This application claims priority to U.S.S.N. 60/141,363 filed June 28, 
1999 by Lin, et aL, U.S.S.N. 60/168,060 filed November 30, 1999 by Lin, et al., 
5 U.S.S.N. 60/177,836 filed January 25, 2000 by Lin, et aL, U.S.S.N, 60/178,368 
filed January 27, 2000 by Lin, et a., and U.S.S.N. 60/210,292 filed June 8, 2000 
by Lin Hong, et al., the teachings of which are incorporated by reference herein. 

This invention is in the area of the design and synthesis of specific 
inhibitors of the aspartic protease Memapsin 2 (beta-secretase) which are useful 

10 in the treatment and/or prevention of Alzheimer' s Disease. 

Alzheimer's disease (AD) is a degenerative disorder of the brain first 
described by Alios Alzheimer in 1907 after examining one of his patients who 
suffered drastic reduction in cognitive abilities and had generalized dementia 
{The early story of Alzheimer's Disease, edited by Bick et al (Raven Press, New 

1 5 York 1 987)). It is the leading cause of dementia in elderly persons. AD patients 
have increased problems with memory loss and intellectual fimctions which 
progress to the point where they cannot function as normal individuals. With 
the loss of intellectual skills the patients exhibit personality changes, socially 
inappropriate actions and schizophrenia {A Guide to the Understanding of 

20 Alzheimer's Disease and Related Disorders, edited by Jorm (New York 

University Press, New York 1987). AD is devastating for both victims and their 
families, for there is no effective palliative or preventive treatment for the 
inevitable neurodegeneration. 

AD is associated with neuritic plaques measuring up to 200 jam in 

25 diameter in the cortex, hippocampus, subiculum, hippocampal gyrus, and 
amygdala. One of the principal constituents of neuritic plaques is amyloid, 
which is stained by Congo Red (Fisher (1983); Kelly Microbiol. Sci. 1(9):214- 
219 (1984)). Amyloid plaques stained by Congo Red are extracellular, pink or 
rust-colored in bright field, and birefringent in polarized light. The plaques are 

30 composed of polypeptide fibrils and are often present around blood vessels, 
reducing blood supply to various neurons in the brain. 
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Various factors such as genetic predisposition, infectious agents, toxins, 
metals, and heeid trauma have all been suggested as possible mechanisms of AD 
neuropathy. Available evidence strongly indicates that there are distinct types of 
genetic predispositions for AD. First, molecular analysis has provided evidence 
5 for mutations in the amyloid precursor protein (APP) gene in certain AD- 

stricken families (Goate et al Nature 349:704-706 (1991); Murrell et al Science 
254:97-99 (1991); Chartier-Harlin et al Nature 353:844-846 (1991); Mullan et 
al. Nature Genet. 1 :345-347 (1992)). Additional genes for dominant forms of 
early onset AD reside on chromosome 14 and chromosome 1 (Rogaev et al, 

10 Nature ?>16\115-11% (1995); Levy-Lahad et al. Science 269:973-977 (1995); 
Sherrington et al. Nature 375:754-760 (1995)). Another loci associated with 
AD resides on chromosome 19 and encodes a variant form of apolipoprotein E 
(Corder, Science 261:921-923 (1993)). 

Amyloid plaques are abundantly present in AD patients and in Down's 

15 Syndrome individuals surviving to the age of 40. The overexpression of APP in 
Down's Syndrome is recognized as a possible cause of the development of AD 
in Down's patients over thirty years of age (Rumble et al. New England J, Med. 
320:1446-1452 (1989); Mann et al, Neurobiol Aging 10:397-399 (1989)). The 
plaques are also present in the normal aging brain, although at a lower number. 

20 These plaques lare made up primarily of the amyloid p peptide (AP; sometimes 
also referred to in the literature as p-amyloid peptide or p peptide) (Glenner and 
Wong, Biochern. Biophys, Res. Comm. 120:885-890 (1984)), which is also the 
primary protein constituent in cerebrovascular amyloid deposits. The amyloid is 
a filamentous material that is arranged in beta-pleated sheets, Ap is a 

25 hydrophobic peptide comprising up to 43 amino acids. 

The determination of its amino acid sequence led to the cloning of the 
APP cDNA (Kang et al. Nature 325:733-735 (1987); Goldgaber et al. Science 
235:877-880 (1987); Robakis et al, Proc. Natl. Acad ScL 84:4190-4194 (1987); 
Tanzi et al. Nature 331 :528-530 (1988)) and genomic APP DNA (Lemaire et 

30 al, Nucl Acids Res, 17:5 17-522 (1989); Yoshikai et al. Gene 87, 257-263 

(1990)). A nuraber of forms of APP cDNA have been identified, including the 
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three most abundant forms, APP695, APP751, and APP770. These forms arise 
from a single precursor RNA by alternate splicing. The gene spans more than 
175 kb with 18 exons (Yoshikai et ah (1990)). APP contains an extracellular 
domain, a transmembrane region and a cytoplasmic domain. Ap consists of up 
5 to 28 amino acids just outside the hydrophobic transmembrane domain and up to 
15 residues of this transmembrane domain. Ap is normally found in brain and 
other tissues such as heart, kidney and spleen. However, Ap deposits are 
usually found in abundance only in the brain. 

Van Broeckhaven et al. Science 248:1 120-1 122 (1990), have 

10 demonstrated that the APP gene is tightly linked to hereditary cerebral 

hemorrhage with amyloidosis (HCHWA-D) in two Dutch families. This was 
confirmed by the finding of a point mutation in the APP coding region in two 
Dutch patients (Levy et al. Science 248:1 124-1 128 (1990)). The mutation 
substituted a glutamine for glutamic acid at position 22 of the Ap (position 618 

15 of APP695, or position 693 of APP770). In addition, certain families are 
genetically predisposed to Alzheimer's disease, a condition referred to as 
familial Alzheimer's disease (FAD), through mutations resulting in an amino 
acid replacement at position 717 of the full length protein (Goate et al (1991); 
Murrell et al (1991); Chartier-Harlin et al (1991)). These mutations co- 

20 segregate with the disease within the families and are absent in families with 
late-onset AD. This mutation at amino acid 717 increases the production of the 
APm2 form of Ap from APP (Suzuki et al. Science 264:1336-1340 (1994)). 
Another mutant form contains a change in amino acids at positions 670 and 671 
of the full length protein (MuUan et ah (1992)). This mutation to amino acids 

25 670 and 671 increases the production of total Ap from APP (Citron et al, 
Nature 360:622-674 (1992)). 

APP is processed in vivo at three sites. The evidence suggests that 
cleavage at the B -secretase site by a membrane associated metalloprotease is a 
physiological event. This site is located in APP 12 residues away from the 

30 lumenal surface of the plasma membrane. Cleavage of the B -secretase site (28 
residues from the plasma membrane's lumenal surface) and the B -secretase site 

3 



1251728 



OMRF 182 
20487/252 



(in the transmembrane region) results in the 40/42-residue B -amyloid peptide (A 
B), whose elev£ited production and accumulation in the brain are the central 
events in the p^ithogenesis of Alzheimer's disease (for review, see Selkoe, DJ. 
Nature 399:23-31 (1999)), Presenilin 1, another membrane protein found in 
5 human brain, controls the hydrolysis at the APP ( B -secretase site and has been 
postulated to be itself the responsible protease (Wolfe, M.S.et al. Nature 
398:513-517 (1999)). Presenilin 1 is expressed as a single chain molecule and 
its processing by a protease, presenilinase, is required to prevent it from rapid 
degradation (Thinakaran, G. et al., Neuron 17:181-190 (1996) and PodHsny, 
10 M.B., et al., Neurobiol Dis. 3:325-37 (1997)). The identity of presenilinase is 
unknown. The in vivo processing of the B -secretase site is thought to be the 
rate-limiting step in A B production (Sinha, S. & Lieberburg, L, Proc. Natl 
Acad. Scl, USA, 96:11049-11053 (1999)), and is therefore a strong therapeutic 
target. 

1 5 The design of inhibitors effective in decreasing amyeloid plaque 

formation is dependent on the identification of the critical enzyme(s) in the 
cleavage of APP to yield the 42 amino acid peptide, the Api.42 form of Ap, 
Although several enzymes have been identified, it has not been possible to 
produce active enzyme. Without active enzyme, one cannot confirm the 

20 substrate specificity, determine the subsite specificity, nor determine the kinetics 
or critical active site residues, all of which are essential for the design of 
inhibitors. 

Memapsin 2 has been shown to be beta-secretase, a key protease 
involved in the production in human brain of beta-amyloid peptide from beta- 

25 amyloid precursor protein (for review, see Selkoe, D.J. Nature 399:23-3 1 
(1999)), It is now generally accepted that the accumulation of beta- amyloid 
peptide in human brain is a major cause for the Alzheimer's disease. Inhibitors 
specifically designed for human memapsin 2 should inhibit or decrease the 
formation of beta-amyloid peptide and the progression of the Alzheimer's 

30 disease. 
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Memapsin 2 belongs to the aspartic protease family. It is homologous in 
amino acid sequence to other eukaryotic aspartic proteases and contains 
motifs specific to that family. These structural similarities predict that 
memapsin 2 and other eukaryotic aspartic proteases share common catalytic 
5 mechanism Davies, D,R., Annu, Rev, Biophys, Chem. 19, 189 (1990). The 
most successM inhibitors for aspartic proteases are mimics of the transition 
state of these enzymes. These inhibitors have substrate-like structure with the 
cleaved planeir peptide bond between the carbonyl carbon and the amide 
nitrogen replaced by two tetrahedral atoms, such as hydroxyethylene [- 
10 CH(0H)-CH2-], which was originally discovered in the structure of pepstatin 
(Marciniszyn et al, 1976). 

However, for clinical use, it is preferable to have small molecule 
inhibitors which will pass through the blood brain barrier and which can be 
readily synthesized. It is also desirable that the inhibitors are relatively 
1 5 inexpensive to manufacture and that they can be administered orally. 

Screening of thousands of compounds for these properties would require an 
enormous effort. To rationally design memapsin 2 inhibitors for treating 
Alzheimer's disease, it will be important to know the three-dimensional 
structure of memapsin 2, especially the binding mode of an inhibitor in the 
20 active site of this protease. 

It is therefore an object of the present invention to provide purified, 
recombinant, and active memapsin 2, as well as its substrate and subsite 
specificity and critical active site residues. 

It is a ftirther object of the present invention to provide compositions and 
25 methods for synthesis of inhibitors of memapsin 2. 

It is a still further object of the present invention to provide compositions 
that interact with memapsin 2 or its substrate to inhibit cleavage by the 
memapsin 2 which can cross the blood brain barrier (BBB). 

It is therefore an object of the present invention to provide means for 
30 rational design and screening of compounds for inhibition of mamapsin 2. 
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Summary of the Invention 

Methods for the production of purified, catalytically active, recombinant 
memapsin 2 have been developed. The substrate and subsite specificity of the 
catalytically active enzyme have been determined. The active enzyme and 
5 assays for catalytic activity are useful in screening libraries for inhibitors of the 
enzyme. 

The substrate and subsite specificity information was used to design 
substrate analogs of the natural memapsin 2 substrate that can inhibit the 
function of memapsin 2. The substrate analogs are based on peptide sequences, 

10 shovm to be related to the natural peptide substrates for memapsin 2. The 

substrate analogs contain at least one analog of an amide (peptide) bond which 
is not capable of being cleaved by memapsin 2. Processes for the synthesis-of 
two substrate analogues including isosteres at the sites of the critical amino acid 
residues were developed and the substrate analogues, OMR99-1 and OM99-2, 

1 5 were synthesized. OM99-2 is based on an octapeptide Glu-Val-Asn-Leu- Ala- 
Ala-Glu-Phe (SEQ ID NO:28) with the Leu-Ala peptide bond substituted by a 
transition-state isostere hydroxyethylene group. The inhibition constant of 
OM99-2 is 1.6 x 10'^ M against recombinant pro-memapsin 2. Crystallography 
of memapsin 2 bound to this inhibitor was used to determine the three 

20 dimensional stmcture of the protein, as well as the importance of the various 
residues in binding. 

This information can be used by those skilled in the art to design new 
inhibitors, using commercially available software programs and techniques 
familiar to those in organic chemistry and enzymology, to design new 

25 inhibitors. For example, the side chains of the inhibitors may be modified to 
produce stronger interactions (through hydrogen bonding, hydrophobic 
interaction, ch^irge interaction and/or van der Waal interaction) in order to 
increase inhibition potency. Based on this type of information, the residues with 
minor interactions may be eliminated from the new inhibitor design to decrease 

30 the molecular weight of the inhibitor. The side chains with no structural 
hindrance from the enzyme may be cross-linked to lock in the effective 
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inhibitor confomation. This type of structure also enables the design of 
peptide surrogates which may effectively fill the binding sites of memapsin 2 
yet produce better pharmaceutical properties. 

The examples demonstrate the production of catalytically active enzyme, 
5 design and synthesis of inhibitors, and how the crystal structure was obtained. 
The examples thereby demonstrate how the methods and materials described 
herein can be used to screen libraries of compounds for other inhibitors, as well 
as for design of inhibitors. These inhibitors are useful in the prevention and/or 
treatment of Alzheimer's disease as mediated by the action of the beta secretase 
10 memapsin 2, in cleaving APP. 

Brief Description of the Drawings 
Figure 1 depicts the plasmid construct of vector pET-lla-memapsin 2- 
Tl and pET-1 la-memapsin 2-T2. The T7 promotor, amino acid sequence from 
the vector (T7 protein) (SEQ ID N0:3), and the beginning and ending of the 
15 memapsin 2 Tl and T2 construct are shown. Construct promemapsin 2-Tl was 
used in the preparation of protein for crystallization and includes residues 1 v- 
15v which are derived from vector pET-1 la. Residues lp-48p are putative pro- 
peptide. Residues 1-393 correspond to the mature protease domain and C- 
terminal extension. The residue numbering of memapsin 2 starts at the aligned 
20 N-terminal position of pepsin (Figure 3). 

Figure 2 A is a graph of the initial rate of hydrolysis of synthetic peptide 
swAPP (see Table 1) by M2pd at different pH. Figure 2B is a graph of the 
relative kcat/Km values for steady-state kinetic of hydrolysis of peptide substrates 
by M2pd. 

25 Figures 3A and 3B are the chemical structures of memapsin 2 

inhibitors, OM99-2 and OM99-1. 

Figure 4A is a graph of the inhibition of recombinant memapsin 2 by 
OM99-1 . Figure 4B is a graph of the inhibition of recombinant memapsin 2 by 
OM99-2. 

30 Figures 5A-E are photographs of crystals of recombinant memapsin 2- 

OM99-2 complex. 
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Figure 6 is a stereo view of crystal structure of memapsin 2 protease 
domain with bound OM99"2.The polypeptide backbone of memapsin 2 is 
shown as a ribbon diagram. The N-lobe and C- lobe are blue and yellow, 
respectively, e}<cept the insertion loops (designated A to G, see Figure 2) in the 
5 C-lobe are magenta and the C-terminal extension is green. The inhibitor bound 
between the lobes is shown in red. 

Figure 7 is a stereo view of comparison of the three-dimensional 
structures of memapsin 2 and pepsin. The molecular surface of the former is 
significantly larger by the insertion of surface loops and helix and the C- 
10 terminal extension. Chain tracing of human memapsin 2 is dark blue and is 
grey for human pepsin. The light blue balls represent identical residues which 
are topological] y equivalent. The disulfide bonds are shown in red for 
memapsin 2 and orange for pepsin. The C-terminal extension is in green. 

Figure 8 is a schematic presentation of interaction between OM99-2 and 
1 5 memapsin 2 protease domain. The S3' and S4' subsites are not defined. 

Figure 9 is a stereo presentation of interactions between inhibitor 
OM99-2 (orange) and memapsin 2 (light blue). Nitrogen and oxygen atoms are 
marked blue and red, respectively. Hydrogen bonds are indicated in yellow 
dotted lines. Memapsin 2 residues which comprise the binding subsites are 
20 included. Residues P4, P3, P2, Pi and Pi' (defined in Figure 8) of OM99-2 are in 
an extended coaformation. Inhibitor chain turns at residue P2' which makes a 
distinct kink at this position. The backbone of P3' and P4' directs the inhibitor to 
exit the active site. 

Figure 10 are schematics of the cross linking between P3 Val and Pi Leu 
25 side chains in the design of new inhibitors for memapsin 2 based on the current 
crystal structure. R and R' at positions P2 and Pi' indicate amino acid side 
chains. Other structural elements of inhibitor are omitted for clarity. 

Figure 1 1 are schematics of the cross linking between P4 Glu and P2 
Asn side chains in the design of new inhibitors for memapsin 2 based on the 
30 current crystal structure. R at position P3 indicates amino acid side chain. Other 
structural elements of inhibitor are omitted for clarity. 
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Figure ] 2 is a schematic of the design for the side chain at the Pi' 
subsite for the new memapsin 2 inhibitors based on the current crystal structure. 
Arrows indicate possible interactions between memapsin 2 and inhibitor. Other 
structural elements of inhibitor are omitted for clarity. 
5 Figure ] 3 is a schematic of the design of two six-membered rings in the 

inhibitor structure by the addition of atoms A and B. The ring formation 
involves the Pi -Leu side chain the the peptide backbone near Pi, P2, and P3. The 
new bonds are in dotted lines. A methyl group can be added to the beta-carbon 
of Pi -Leu. Other structural elements of inhibitor are omitted for clarity. 

1 0 Detailed Description of the Invention 

L Preparation of Catalvticallv Active Recombinant Memapsin 2 
Cloning, and Expression of Memapsin 2 
Memapsin 2 was cloned and the nucleotide (SEQ ID NO. 1) and 
predicted amino acid (SEQ ID NO. 2) sequences were determined, as described 

15 in Example 1. The cDN A was assembled from the fragments. The nucleotide 
and the deduced protein sequence are shown in SEQ ID NOs. 1 and 2, 
respectively. The protein is the same as the aspartic proteinase 2 (ASP2) 
described in EP 0 855 444 A by SmithKline Beecham Pharmaceuticals, 
(published July 29, 1998), and later reported by Sinha, et al., Nature 402, 537- 

20 540 (December 1999) and Vassar, et al, Science 286, 735-741 (22 October 
1999). 

Pro-memapsin 2 is homologous to other human aspartic proteases. 
Based on the alignments, Pro-memapsin 2 contains a pro region, an aspartic 
protease region, and a trans-membrane region near the C-terminus, The C- 
25 terminal domain is over 80 residues long. The active enzyme is memapsin 2 and 
its pro-enzyme is pro-memapsin 2. 

Refoldigg Catalytically Active Enzyme 

In order to determine the substrate specificity and to design inhibitors, it 
is necessary to express catalytically active recombinant enzyme. No other 
30 knovra proteases contain a transmembrane domain. The presence of 

transmembrane domains makes the recombinant expression of these proteins 
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less predictable and more difficult. The transmembrane region often needs to be 
removed so that secretion of the protein can take place. However, the removal 
of the transmembrane region can often alter the structure and/or fimction of the 
protein. 

5 The starting assumption was that the region of memapsin 2 that is 

homologous with other aspartic proteases would independently fold in the 
absence of the Ixansmembrane domain, and would retain protease activity in the 
absence of the C-terminal transmembrane region. The transmembrane region 
appears to serve as a membrane anchor. Since the active site is not in the 

10 transmembrane region and activity does not require membrane anchoring, 

memapsin 2 was expressed in E. coli in two different lengths, both without the 
transmembrane region, and purified, as described in Example 3. The procedures 
for the culture of transfected bacteria, induction of synthesis of recombinant 
proteins and the recovery and washing of inclusion bodies containing 

15 recombinant proteins are essentially as described by Lin et al, (1994). 

Refolding was not a simple matter, however. Two different refolding methods 
both produced satisfactory results. In both methods, the protein was dissolved in 
a strong denaturing/reducing solution such as 8 M urea/ 100 mM beta- 
mercaptoethanol. The rate at which the protein was refolded, and in what 

20 solution, was critical to activity. In one method, the protein is dissolved into 8 
M urea/ 100 mM beta-mercaptoethanol then rapidly diluted into 20 volumes of 
20 mM-Tris, pH 9.0, which is then slowly adjusted to pH 8 with 1 M HCl. The 
refolding solution was then kept at 4^ C for 24 to 48 hours before proceeding 
with purification. In the second method, an equal volume of 20 mM Tris, 0.5 

25 mM oxidized/1 .25 mM reduced glutathione, pH 9.0 is added to rapidly stirred 
pro-memapsin 2 in 8 M urea/10 mM beta-mercaptoethanol. The process is 
repeated three more times with 1 hour intervals. The resulting solution is then 
dialyzed against sufficient volume of 20 mM Tris base so that the final urea 
concentration is 0.4 M. The pH of the solution is then slowly adjusted to 8.0 

30 with 1 M HCl. 
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The refolded protein is then further purified by column chromatography, 
based on molecular weight exclusion, and/or elution using a salt gradient, and 
analyzed by SDS-PAGE analysis under reduced and non-reduced conditions. 
II. Substrate Specificity and Enzyme Kinetics of Memapsin 2 
5 Substrate Specificity 

The tissue distribution of the memapsin 2 was determined, as described 
in Example 2. The presence of memapsin 2 (M2) in the brain indicated that it 
might hydrolyze the p-amyloid precursor protein (APP). As described below, 
detailed enzymatic and cellular studies demonstrated that M2 fits all the criteria 

10 of the B-secretase. 

The M2 three-dimensional structure modeled as a type I integral 
membrane protein. The model suggested that its globular protease unit can 
hydrolyze a membrane anchored polypeptide at a distance range of 20-30 
residues from the membrane surface. As a transmembrane protein of the brain, 

15 APP is a potential substrate and its beta-secretase site, located about 28 residues 
from the plasma membrane surface, is within in the range for M2 proteolysis. 

A synthetic peptide derived from this site (SEVKM/DAEFR) (SEQ ID 
N0:4) was hydrolyzed by M2pd (modified M2 containing amino acids from Ala" 
to Ala^^^) at the beta-secretase site (marked by /). A second peptide 

20 (SEVNL/DAEFR) (SEQ ID N0:5) derived from the APP beta-secretase site and 
containing the 'Swedish mutation' (Mullan, M. et al^ Nature Genet, 2:340-342 
(1992)), known to elevate the level of alpha-beta production in cells (Citron, M. 
et al. Nature 26^:612-61 A (1992)), was hydrolyzed by M2pci with much higher 
catalytic efficiency. Both substrates were optimally cleaved at pH 4.0. A 

25 peptide derived from the processing site of presenilin 1 (SVNM/AEGD) (SEQ 
ID N0:6) was also cleaved by M2pci with less efficient kinetic parameters. A 
peptide derived from the APP gamma-secretase site (KGGVVIATVIVK) (SEQ 
ID N0:7) was aot cleaved by M2pd. Pepstatin A inhibited M2pd poorly (IC50 
approximately approximately 0.3 mM). The kinetic parameters indicate that 

30 both presenilin 1 (kcat, 0.67 s"^ Km, 15.2 mM; kcat/Km, 43.8 s'^M'^) and native 
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APP peptides (Iccat/Km, 39.9 s"^M'^) are not as good substrates as the Swedish 
APP peptide (kcat, 2.45 s'^Km, 1 mM; kcat/K^, 2450 s"^M"^). 

To deteimine if M2 possesses an APP beta-secretase function in 
mammalian cells, memapsin 2 was transiently expressed in HeLa cells (Lin, X., 
5 et al., FASEB 1 7: 1070-1080 (1993)), metabolically pulse-labeled with ^^S-Met, 
then immunoprecipitated with anti-APP antibodies for visualization of APP- 
generated fragments after SDS-polyacrylamide electrophoresis and imaging. 
SDS-PAGE patterns of immuno-precipitated APP N6-fragment (97 kD band) 
from the conditioned media (2 h) of pulse-chase experiments showed that APP 

10 was cleaved by M2. Controls transfected with APP alone and co-transfected 
with APP and M2 with Bafilomycin Al added were performed. SDS-PAGE 
patterns of APP BC-fragment (12 kD) were immunoprecipitated from the 
conditioned media of the same experiment as discussed above. Controls 
transfected witli APP alone; co-transfected with APP and M2; co-transfected 

1 5 with APP and M2 with Bafilomycin Al ; transfections of Swedish APP; and co- 
transfections of Swedish APP and M2 were performed. SDS-PAGE gels were 
also run of immuno-precipitated M2 (70 kD), M2 transfected cells; 
untransfected HeLa cells after long time film exposure; and endogenous M2 
from HEK 293 cells. SDS-PAGE patterns of APP fragments (100 kD betaN- 

20 fragment and 95 kD betaN-fragment) recovered from conditioned media after 
immuno-precipitation using antibodies specific for different APP regions 
indicated that memapsin 2 cleaved APP. 

Cells expressing both APP and M2 produced the 97 kD APP beta N- 
fragment (from the N-terminus to the beta-secretase site) in the conditioned 

25 media and the 12 kD betaC-fragment (from the beta-secretase site to the C- 

terminus) in the cell lystate. Controls transfected with APP alone produced little 
detectable beta;Nf-fragment and no beta C-fragment. Bafilomycin Al, which is 
known to raise the intra- vesicle pH of lysosomes/endosomes and has been 
shown to inhibit APP cleavage by beta-secretase (Knops, J. et al., J. Biol Chem. 

30 270:2419-2422 (1995)), aboHshed the production of both APP fragments beta 
N- and beta C- in co-transfected cells. Cells transfected with Swedish APP 
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alone did not produce the beta C-fragment band in the cell lysate but the co- 
transfection of Swedish APP and M2 did. This Swedish beta C-fragment band 
is more intense than that of wild-type APP. A 97-kD beta N-band is also seen in 
the conditioned media but is about equal intensity as the wild-type APP 
5 transfection. 

These results indicate that M2 processes the beta-secretase site of APP in 
acidic compartments such as the endosomes. To establish the expression of 
transfected M2 gene, the pulse-labeled cells were lysed and immuno- 
precipitated by anti-M2 antibodies. A 70 kD M2 band was seen in cells 

10 transfected witli M2 gene^ which has the same mobility as the major band from 
HEK 293 cells known to express beta-secretase (Citron, M. et al., Nature 
260:612-61 A (1992)). A very faint band of M2 is also seen, after a long film 
exposure, in untransfected HeLa cells, indicating a very low level of endogenous 
M2, which is insufficient to produce betaN- or betaC -fragments without M2 

15 transfection. Antibody alpha-betai.n, which specifically recognizes residues 1- 
1 7 in alpha-beta peptide, was used to confirm the correct beta-secretase site 
cleavage. In cells transfected with APP and M2, both beta N- and beta N- 
ftagments are visible using an antibody recognizing the N-terminal region of 
APP present in both fi-agments. Antibody Abetai-n recognize the beta N- 

20 fi-agment produced by endogenous beta-secretase in the untransfected cells. 

This antibody was, however, unable to recognize the betaN-fi-agment known to 
be present in cells co-transfected with APP and M2, These observations 
confirmed that betaN-fragment is the product of beta-secretase site cut by M2, 
which abolished the recognition epitope of alpha-betai-n. 

25 The processing of APP by M2 predicts the intracellular colocalization of 

the two proteins. HeLa cells co-expressing APP and M2 were stained with 
antibodies directed toward APP and M2 and visualized simultaneously by 
CSLM using a lOOx objective. Areas of colocalization appeared in yellow. 

Immunodetection observed by confocal microscopy of both APP and M2 

30 revealed their colocalization in the superimposed scans. The distribution of both 
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proteins is consistent with their residence in lysosomal/endosomal 
compartments. 

In specificity studies, it was found that M2pd cleaved its pro peptide (2 
sites) and the protease portion (2 sites) dxiring a 16 h incubation after activation 
5 (Table 1). Besides the three peptides discussed above, M2pd also cleaved 
oxidized bovine insulin B chain and a synthetic peptide Nch. Native proteins 
were not cleaved by M2pd. 

The data indicate that human M2 fulfills all the criteria of a beta- 
secretase which cleaves the beta-amyloid precursor protein (APP): (a) M2 and 

10 APP are both membrane proteins present in human brain and co-localize in 

mammalian cells, (b) M2 specifically cleaves the beta-secretase site of synthetic 
peptides and of APP in cells, (c) M2 preferentially cleaves the beta-secretase site 
from the Swedish over the wild-type APP, and (d) the acidic pH optimum for 
M2 activity and bafilomycin Al inhibition of APP processing by M2 in the cells 

15 are consistent v/ith the previous observations that beta-secretase cleavage occurs 
in acidic vesicles (Knops, J., et al., J. Biol Chem. 270:2419-2422 (1995)). The 
spontaneous appearance of activity of recombinant pro-M2 in an acidic solution 
suggests that, intracellularly, this zymogen can by itself generate activity in an 
acidic vesicle like an endosome. 

20 II. Design and Synthesis of Inhibitors 

Design of Substrate Analogs for Memapsin 2. 
The five human aspartic proteases have homologous amino acid 
sequences and have similar three-dimensional structures. There are two aspartic 
residues in the active site and each residue is foimd within the signature aspartic 

25 protease sequence motif, Asp-Thr/Ser-Gly- (SEQ ID NO: 8). There are 

generally two homologous domains within an aspartic protease and the substrate 
binding site is positioned between these two domains, based on the three- 
dimensional structures. The substrate binding sites of aspartic proteases 
generally recognize eight amino acid residues. There are generally four residues 

30 on each side of the amide bond which is cleaved by the aspartic protease. 
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Typically the side chains of each amino acid are involved in the 
specificity of the substrate/aspartic protease interaction. The side chain of each 
substrate residue is recognized by regions of the enzyme which are collectively 
called sub-sites. The generally accepted nomenclature for the protease sub-sites 
5 and their corresiponding substrate residues are shown below, where the double 
slash represents the position of bond cleavage. 
Protease sub-sites S4 S3 S2 SI SI' S2' S3' S4' 

Substrate residues P4 P3 P2 PI //PI' P2' P3' P4' 

While there is a general motif for aspartic protease substrate recognition, 

10 each protease has a very different substrate specificity and breadth of specificity. 
Once the specificity of an aspartic protease is known, inhibitors can be designed 
based on that specificity, which interact with the aspartic protease in a way that 
prevents natural substrate from being efficiently cleaved. Some aspartic 
proteases have specificities which can accommodate many different residues in 

1 5 each of the sub-sites for successful hydrolysis. Pepsin and cathepsin D have this 
type of specificity and are said to have "broad" substrate specificity. When only 
a very few residues can be recognized at a sub-site, such as in renin, the aspartic 
protease is said to have a stringent or narrow specificity. 

The information on the specificity of an aspartic protease can be used to 

20 design specific inhibitors in which the preferred residues are placed at specific 
sub-sites and thie cleaved peptide bond is replaced by an analog of the transition- 
state. These analogs are called transition state isosteres. Aspartic proteases 
cleave amide bonds by a hydrolytic mechanism. This reaction mechanism 
involves the attack by a hydroxide ion on the B-carbon of the amino acid. 

25 Protonation must occur at the other atom attached to the B-carbon through the 
bond that is to be cleaved. If the fl-carbon is insufficiently electrophilic or the 
atom attached to the bond to be cleaved is insufficiently nucleophilic the bond 
will not be cleaved by a hydrolytic mechanism. Analogs exist which do not 
mimic the transition state but which are non-hydrolyzable, but transition state 

30 isosteres mimic the transition state specifically and are non-hydrolyzable. 
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Transition state theory indicates that it is the transition state intermediate 
of the reaction which the enzyme catalyzes for which the enzyme has its highest 
affinity. It is the transition state structure^ not the ground state structure, of the 
substrate which will have the highest affinity for its given enzyme. The 
5 transition state for the hydrolysis of an amide bond is tetrahedral while the 
ground state structure is planar. A typical transition-state isostere of aspartic 
protease is -CH(0H)-CH2-, as was first discovered in pepstatin by Marciniszyn 
et al. (1976). The transition-state analogue principles have been successfiilly 
applied to inhibitor drugs for human immunodeficiency virus protease, an 

10 aspartic protease. Many of these are currently in clinical use. Information on 
the structure, specificity, and types of inhibitors can be found in Tang, Acid 
Proteases, Structure, Function and Biology, Adv. in ExptL Med. Biol. vol. 95 
(Plenum Press, NY 1977); Kostka, Aspartic Proteinases and their Inhibitors 
(Walter de Qruyter, Berlin 1985); Dunn, Structure and Fimctions of the Aspartic 

15 Proteinases, Adv. in ExptL Med. Biol. 306 (Plenum Press, NY 1991); 
Takahashi, Aspartic Proteases, Structure, Function, Biology, Biomedical 
Implications, Adv. in ExptL Med. Biol. 362 (Plenum Press, NY 1995); and 
James, Aspartic Proteinases, Retroviral and Cellular Enzymes, Adv. in ExptL 
Med. Biol. 436 (Plenum Press, NY 1998)). 

20 Substrate analog compositions are generally of the general formula X- 

L4-P4- L3-P3-L2-P2-Li-Pi-Lo-Pi'-Lr-P2'-L2'-P3'-L3'-P4'L4'-Y. The substrate 
analog compositions are analogs of small peptide molecules. Their basic 
structure is derived from peptide sequences that were determined through 
structure/function studies. It is understood that positions represented by Px 

25 represent the substrate specificity position relative to the cleavage site which is 
represented by an -Lq-. The positions of the compositions represented by Lx 
represent the linking regions between each substrate specificity position, Px. 

In a natural substrate for memapsin 2, a Px-Lx pair would represent a 
single amino acid of the peptide which is to be cleaved. In the present general 

30 formula, each Px part of the formula refers to the a-carbon and side chain 

fimctional group of each would be amino acid. Thus, the Px portion of an Px-Lx 

16 
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pair for alanine represents HC- CH3. The general formula representing the Px 
portion of the general composition is -R1CR3-. 

In general Ri can be either CH3 (side chain of alanine), CH(CH3)2 (side 
chain of valine), CH2CH(CH3)2 (side chain of leucine), (CH3)CH(CH2 CH3) 
5 (side chain of isoleucine), CH2(Indole) (side chain of tryptophan), 

CH2(Benzene) (side chain of phenylalanine), CH2CH2SCH3 (side chain of 
methionine), H (side chain of glycine), CH2OH (side chain of serine), 
CHOHCH3 (side chain of threonine), CH2(Phenol) (side chain of tyrosine), 
CH2SH (side chain of cysteine), CH2CH2CONH2 (side chain of glutamine), 

10 CH2CONH2 (side chain of asparagine), CH2CH2CH2CH2NH2 (side chain of 
lysine), CH2CH2CH2NHC(NH)(NH2) (side chain of arginine), CH2(Imidazole) 
(side chain of histidine), CH2COOH (side chain of aspartic acid), 
CH2CH2COOH (side chain of glutamic acid), and functional natural and non- 
natural derivatives or synthetic substitutions of these. 

15 It is most preferred that R3 is a single H. In general, however, R3 can be 

alkenyl, alkynal, alkenyloxy, and alkynyloxy groups that allov^ binding to 
memapsin 2. Preferably, alkenyl, alkynyl, alkenyloxy and alkynyloxy groups 
have from 2 to 40 carbons, and more preferably from 2 to 20 carbons, from 2 to 
10 carbons, or from 2 to 3 carbons., and functional natural and non-natural 

20 derivatives or synthetic substitutions of these. 

The Lx portion of the Px-Lx pair represents the atoms linking the Px 
regions together. In a natural substrate the Lx represents the B-carbon attached 
to the amino portion of what would be the next amino acid in the chain. Thus, 
Lx would be represented by ~CO-NH-. The general formula for Lx is 

25 represented by R2. In general R2 can be CO-HN (amide), CH(0H)(CH2) 

(hydroxyethylene), CH(OH)CH(OH) (dihydroxyethylene), CH(0H)CH2NH 
(hydroxyethylamine), P0(0H)CH2 (phosphinate), CH2NH, (reduced amide). It 
is understood that more than one L- maybe an isostere as long as the substrate 
analog functions to inhibit aspartic protease function. 

30 Ls which are not isosteres may either be an amide bond or mimetic of an 

amide bond that is non-hydrolyzable. 
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X and Y represent molecules which are not typically involved in the 
recognition by the aspartic protease recognition site, but which do not interfere 
with recognition. It is preferred that these molecules confer resistance to the 
degradation of the substrate analog. Preferred examples would be amino acids 
5 coupled to the substrate analog through a non-hydrolyzable bond. Other 

preferred compounds would be capping agents. Still other preferred compounds 
would be compounds which could be used in the purification of the substrate 
analogs such as biotin. 

As used herein, alkyl refers to substituted or unsubstituted straight, 

10 branched or cyclic alkyl groups; and alkoxyl refers to substituted or 

unsubstituted straight, branched or cyclic alkoxy. Preferably, alkyl and alkoxy 
groups have from 1 to 40 carbons, and more preferably from 1 to 20 carbons, 
from 1 to 10 carbons, or from 1 to 3 carbons. 

As used herein, alkenyl refers to substituted or unsubstituted straight 

1 5 chain or branched alkenyl groups; alkynyl refers to substituted or unsubstituted 
straight chain or branched alkynyl groups; alkenyloxy refers to substituted or 
unsubstituted straight chain or branched alkenyloxy; and alkynyloxy refers to 
substituted or unsubstituted straight chain or branched alkynyloxy. Preferably, 
alkenyl, alkynyl, alkenyloxy and alkynyloxy groups have from 2 to 40 carbons, 

20 and more preferably from 2 to 20 carbons, from 2 to 10 carbons, or from 2 to 3 
carbons. 

As used herein, alkaryl refers to an alkyl group that has an aryl 
substituent; aralkyl refers to an aryl group that has an alkyl substituent; 
heterocyclic-alkyl refers to a heterocyclic group with an alkyl substituent; alkyl- 
25 heterocyclic refers to an alkyl group that has a heterocyclic substituent. 

The substituents for alkyl, alkenyl, alkynyl, alkoxy, alkenyloxy, and 
alkynyloxy groups can be halogen, cyano, amino, thio, carboxy, ester, ether, 
thioether, carboxamide, hydroxy, or mercapto. Further, the groups can 
optionally have one or more methylene groups replaced with a heteroatom, such 
30 as O, NH or S. 
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A number of different substrates were tested and analyzed, and the 
cleavage rules for Memapsin 2 were determined. The results of the substrates 
which were analyzed are presented in Table 1 and the rules determined from 
these results are summarized below. 
5 (1) The primary specificity site for a memapsin 2 substrate is subsite position, 
Pi'. This means that the most important determinant for substrate specificity in 
memapsin 2 is the amino acid, ST. Pi' must contain a small side chain for 
memapsin 2 to recognize the substrate. Preferred embodiments are substrate 
analogs where Ri of the Pi ' position is either H (side chain of glycine), CH3 
10 (side chain of alanine), CH2OH (side chain of serine), or CH2OOH (side chain 
of aspartic acid). Embodiments that have an Rl structurally smaller than CH3 
(side chain of alanine) or CH2OH (side chain of serine) are also preferred. 

(2) There are no specific sequence requirements at positions P4, P3, P2, Pi, P2', 
P3', and P4' Batch site can accommodate any other amino acid residue in 

1 5 singularity as long as rule number 3 is met. 

(3) At least two of the remaining seven positions, P4, P3, P2, Pi, P2', P3', and 
P4', must have an Ri which is made up of a hydrophobic residue. It is preferred 
that there are at least three hydrophobic residues in the remaining seven 
positions, P4, P3, P2, Pi, ^2% ^3% and P4\ Preferred Ri groups for the positions 

20 that contain a hydrophobic group are CH3 (side chain of alanine), CH(CH3)2 
(side chain of valine), CH2CH(CH3)2 (side chain of leucine), (CH3)CH(CH2 
CH3) (side chain of isoleucine), CH2(IND0LE) (side chain of tryptophan), 
CH2(Benzene) (side chain of phenylalanine), CH2CH2SCH3 (side chain of 
methionine) CH2(Phenol) (side chain of tyrosine). It is more preferred that the 

25 hydrophobic group be a large hydrophobic group. Preferred Ris which contain 
large hydrophobic groups are CH(CH3)2 (side chain of valine), CH2CH(CH3)2 
(side chain of leucine), (CH3)CH(CH2 CH3) (side chain of isoleucine), 
CH2(Indole) (side chain of tryptophan), CH2(Benzene) (side chain of 
phenylalanine)., CH2CH2SCH3 (side chain of methionine) CH2(Phenol) (side 

30 chain of tyrosine). It is most preferred that positions with a hydrophobic Ri are 
CH(CH3)2 (side chain of valine), CH2CH(CH3)2 (side chain of leucine), 



1251728 



19 



OMRF 182 
20487/252 



CH2(Benzene) (side chain of phenylalanine), CH2CH2SCH3 (side chain of 
methionine), or CH2(Phenol) (side chain of tyrosine). 

(4) None of the eight positions, P4, P3, P2, Pi, Pi/ ^2% Ps', and P4' may have a 
proline side chain at its Rl position. 
5 (5) Not all subsites must have an P represented in the analog. For example, a 
substrate analog could have X-P2"Li-Pi-Lo-Pi'-Li'-P2'-L2'-P3'-L3'-Y or it could 
have X-Li-Pi-Lo-Pr-Li'-P2'-L2'-P3'-L3'-P4'L4'-Y. 

Preferred substrate analogs are analogs having the sequences disclosed in 
Table 1, with the non-hydrolyzable analog between PI and PT. 

10 Combinatorial Chemistry to Make Inhibitors 

Combinatorial chemistry includes but is not limited to all methods for 
isolating molecules that are capable of binding either a small molecule or 
another macromolecule. Proteins, oligonucleotides, and polysaccharides are 
examples of macromolecules. For example, oligonucleotide molecules with a 

1 5 given function, catalytic or ligand-binding, can be isolated from a complex 
mixture of random oligonucleotides in what has been referred to as "m vitro 
genetics" (Szostak, TIBS 19:89, 1992). One synthesizes a large pool of 
molecules bearing random and defined sequences and subjects that complex 
mixture, for example, approximately 10^^ individual sequences in 100 |ag of a 

20 100 nucleotide RNA, to some selection and enrichment process. Through 
repeated cycles of affinity chromatography and PCR amplification of the 
molecules bound to the ligand on the column, Ellington and Szostak (1990) 
estimated that 1 in 10^^ RNA molecules folded in such a way as to bind a small 
molecule dyes. DNA molecules with such ligand-binding behavior have been 

25 isolated as well (Ellington and Szostak, 1992; Bock et al, 1992). 

Techniq[ues aimed at similar goals exist for small organic molecules, 
proteins and peptides and other molecules known to those of skill in the art. 
Screening sets of molecules for a desired activity whether based on libraries of 
small synthetic molecules, oligonucleotides, proteins or peptides is broadly 

30 referred to as combinatorial chemistry. 
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There are a number of methods for isolating proteins either have de novo 
activity or a modifed activity. For example, phage display libraries have been 
used for a number of years. A preferred method for isolating proteins that have 
a given function is described by Roberts and Szostak (Roberts R.W. and Szostak 
5 J.W. Proc. Natl. Acad. Sci. USA, 94(23)12997-302 (1 997). Another preferred 
method for combinatorial methods designed to isolate peptides is described in 
Cohen et al. (Cohen B.A., et al, Proc. Natl. Acad. Sci. USA 95(24): 14272-7 
(1998)). This method utilizes a modified two-hybrid technology. Yeast 
two-hybrid systems are useful for the detection and analysis of protein:protein 

10 interactions. The two-hybrid system, initially described in the yeast 

Saccharomyces cerevisiae^ is a powerful molecular genetic technique for 
identifying nev/ regulatory molecules, specific to the protein of interest (Fields 
and Song, Nature 340:245-6 (1989)). Cohen et al., modifed this technology so 
that novel interactions between synthetic or engineered peptide sequences could 

15 be identified which bind a molecule of choice. The benefit of this type of 
technology is that the selection is done in an intracellular environment. The 
method utilizes a library of peptide molecules that attach to an acidic activation 
domain. A peptide of choice, for example an extracellular portion of memapsin 
2 is attached to a DN A binding domain of a transcriptional activation protein, 

20 such as Gal 4. By performing the Two-hybrid technique on this type of system, 
molecules that bind the extracellular portion of memapsin 2 can be identified. 
Screening of Small Molecule Libraries 
In addition to these more specialized techniques, methodology well 
known to those of skill in the art, in combination with various small molecule or 

25 combinatorial libraries, can be used to isolate and characterize those molecules 
which bind to or interact with the desired target, either memapsin 2 or its 
substrate. The relative binding affinity of these compoxmds can be compared 
and optimum inhibitors identified using competitive or non-competitive binding 
studies which sire well known to those of skill in the art. Preferred competitive 

30 inhibitors are non-hydro lyzable analogs of memapsin 2. Another will cause 
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allosteric rearriingements which prevent memapsin 2 from functioning or 
folding correctly. 

Computer assisted Rational Drug Design 

Another way to isolate inhibitors is through rational design. This is 
5 achieved through structural information and computer modeling. Computer 
modeling technology allows visualization of the three-dimensional atomic 
structure of a selected molecule and the rational design of new compounds that 
will interact with the molecule. The three-dimensional construct typically 
depends on data from x-ray crystallographic analyses or NMR imaging of the 

10 selected molecule. The molecular dynamics require force field data. The 

computer graphics systems enable prediction of how a new compound will link 
to the target molecule and allow experimental manipulation of the structures of 
the compound and target molecule to perfect binding specificity. For example, 
using NMR spectroscopy, Inouye and coworkers were able to obtain the 

1 5 structural information of N-terminal truncated TSHK (transmembrane sensor 
histidine kinases) fragments which retain the structure of the individual sub- 
domains of the catalytic site of a TSHK. On the basis of the NMR study, they 
were able to identify potential TSHK inhibitors (U.S. Patent No. 6,077,682 to 
Inouye). Another good example is based on the three-dimensional structure of a 

20 calcineurin/FKBP 1 2/FK506 complex determined using high resolution X-ray 
crystallography to obtain the shape and structure of both the calcineurin active 
site binding pocket and the auxiliary FKBP12/FK506 binding pocket (U.S. 
Patent No. 5,978,740 to Armistead). With this information in hand, researchers 
can have a good understanding of the association of natural ligands or substrates 

25 with the binding pockets of their corresponding receptors or enzymes and are 
thus able to design and make effective inhibitors. 

Prediction of molecule-compound interaction when small changes are 
made in one or both requires molecular mechanics software and computationally 
intensive computers, usually coupled with user-friendly, menu-driven interfaces 

30 between the molecular design program and the user. Examples of molecular 
modeling systems are the CHARMm and QUANTA programs, Polygen 
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Corporation, Waltham, MA. CHARMm performs the energy minimization and 
molecular dynamics functions. QUANTA performs the construction, graphic 
modeling and analysis of molecular stracture, QUANTA allows interactive 
construction, miodification, visualization, and analysis of the behavior of 
5 molecules with each other. 

A number of articles review computer modeling of drugs interactive with 
specific proteins, such as Rotivinen, et al., 1988 Acta Pharmaceutica Fennica 
97, 159-166; Ripka, New Scientist 54-57 (June 16, 1988); McKinaly and 
Rossmaim, \9%9 Annu. Rev, Pharmacol. ToxicioL 29, 111-122; Perry and 

1 0 Davies, QSAR: Quantitative Structure-Activity Relationships in Drug Design 
pp. 189-193 (Alan R. Liss, Inc. 1989); Lewis and Dean, 1989 Proa R, Soc. 
Lond 236, 125-140 and 141-162; and, with respect to a model enzyme for 
nucleic acid components, Askew, et al., 1989 J. Am. Chem. Soc. Ill, 1082- 
1090. Other computer programs that screen and graphically depict chemicals 

15 are available from companies such as BioDesign, Inc., Pasadena, CA., AUelix, 
Inc, Mississauga, Ontario, Canada, and Hypercube, Inc., Cambridge, Ontario, 
Although described above with reference to design and generation of 
compounds which could alter binding, one could also screen libraries of known 
compounds, including natural products or synthetic chemicals, and biologically 

20 active materials, including proteins, for compounds which alter substrate 
binding or enzj/matic activity. 
Screening of Libraries 

Design of substrate analogs and rational drug design are based on 
knowledge of the active site and target, and utilize computer software programs 

25 that create detailed structures of the enzyme and its substrate, as well as ways 
they interact, alone or in the presence of inhibitor. These techniques are 
significantly enhanced with x-ray crystallographic data in hand. Inhibitors can 
also be obtained by screening libraries of existing compoimds for those which 
inhibit the catalytically active enzyme. In contrast to reports in the literature 

30 relating to memapsin 2, the enzyme described herein has activity analogous to 
the naturally produced enzyme, providing a means for identifying compounds 
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which inhibit the endogenous activity. These potential inhibitors are typically 
identified using high throughput assays, in which enzyme, substrate (preferably 
a chromogenic substrate) and potential inhibitor (usually screened across a range 
of concentrations) are mixed and the extent of cleavage of substrate determined. 
5 Potentially usefol inhibitors are those which decrease the amount of cleavage. 
IL Methods of diagnosis and treatment 

Inhibitors can be used in the diagnosis and treatment and/or prevention 
of Alzheimer's disease and conditions associated therewith, such as elevated 
levels of the forty-two amino acid peptide cleavage product, and the 
1 0 accumulation of the peptide in amyeloid plaques. 

Diagnostic Uses 

The substrate analogs can be used as reagents for specifically binding to 
memapsin 2 or memapsin 2 analogs and for aiding in memapsin 2 isolation and 
purification or characterization, as described in the examples. The inhibitors 
1 5 and purified recombinant enzyme can be used in screens for those individuals 
more genetically prone to develop Alzheimer's disease. 
Therapeutic Uses 

Recombinant human memapsin 2 cleaves a substrate with the sequence 
LVNM/AEGD (SEQ ID N0:9). This sequence is the in vivo processing site 

20 sequence of human presenilins. Both presenilin 1 and presenilin 2 are integral 
membrane proteins. They are processed by protease cleavage, which removes 
the N terminal sequence from the unprocessed form. Once processed, presenilin 
forms a two-chain heterodimer (Capell et al., J. Biol. Chem. 273, 3205 (1998); 
Thinakaran et al, NeurobioL Dis. 4, 438 (1998); Yu et al., Neurosci Lett. 

25 2;254(3): 125-8 (1 998)), which is stable relative to the improcessed presenilins. 
Unprocessed presenilines are quickly degraded (Thinakaran et al., J. Biol 
Chem, 272, 28415 (1997); Steiner et al, J. Biol. Chem. 273, 32322 (1998)). It 
is known that presenilin controls the in vivo activity of beta-secretase, which in 
turn cleaves the amyloid precursor protein (APP) leading to the formation of 

30 alpha-beta42. The accumulation of alpha-beta42 in the brain cells is known to 
be a major cause of Alzheimer's disease (for review, see Selkoe, 1998). The 
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activity of presenilin therefore enhances the progression of Alzheimer's disease. 
This is supported by the observation that in the absence of presenilin gene, the 
production of alpha-beta42 peptide is lowered (De Strooper et al., Nature 391, 
387 (1998)). Since unprocessed presenilin is degraded quickly, the processed, 

5 heterodimeric presenilin must be responsible for the accumulation of alpha- 
beta42 leading to Alzheimer's disease. The processing of presenilin by 
memapsin 2 would enhance the production of alpha-beta42 and therefore, 
further the progress of Alzheimer's disease. Therefore a memapsin 2 inhibitor 
that crosses the blood brain barrier can be used to decrease the likelihood of 

1 0 developing or slow the progression of Alzheimer' s disease which is mediated by 
deposition of alpha-beta42. Since memapsin 2 cleaves APP at the beta cleavage 
site, prevention of APP cleavage at the beta cleavage site will prevent the build 
up of alpha-beta42. 
Vaccines 

1 5 The catalytically active memapsin 2 or fragments thereof including the 

active site defined by the presence of two catalytic aspartic residues and 
substrate binding cleft can be used to induce an immune response to the 
memapsin 2. The memapsin 2 is administered in an amount effective to elicit 
blocking antibodies, i.e., antibodies which prevent cleavage of the naturally 

20 occurring substrate of memapsin 2 in the brain. An unmodified vaccine may be 
useful in the prevention and treatment of Alzheimer's disease. The response to 
the vaccine may be influenced by its composition, such as inclusion of an 
adjuvant, viral proteins from production of the recombinant enzyme, and/or 
mode of administration (amount, site of administration, frequency of 

25 administration, etc). Since it is clear that the enzyme must be properly folded in 
order to be active, antibody should be elicited that is active against the 
endogenous memapsin 2. Antibodies that are effective against the endogenous 
enzyme are less likely to be produced against the enzyme that is not properly 
refolded. 
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Pharmaceutically Acceptable Carriers 

The inhibitors will typically be administered orally or by injection. Oral 
administration is preferred. Alternatively, other formulations can be used for 
delivery by pulmonary, mucosal or transdermal routes. The inhibitor vv^ill 
5 usually be administered in combination with a pharmaceutically acceptable 
carrier. Pharmaceutical carriers are known to those skilled in the art. The 
appropriate canler will typically be selected based on the mode of 
administration. Pharmaceutical compositions may also include one or more active 
ingredients such as antimicrobial agents, antiinflammatory agents, and analgesics. 

iO Preparations for parenteral administration or administration by injection 

include sterile aiqueous or non-aqueous solutions, suspensions, and emulsions. 
Examples of non-aqueous solvents are propylene glycol, polyethylene glycol, 
vegetable oils such as olive oil, and injectable organic esters such as ethyl 
oleate. Aqueous carriers include water, alcoholic/aqueous solutions, emulsions 

1 5 or suspensions, including saline and buffered media. Preferred parenteral 
vehicles include sodium chloride solution. Ringer's dextrose, dextrose and 
sodium chloride, lactated Ringer's, or fixed oils. Intravenous vehicles include 
fluid and nutrient replenishers, and electrolyte replenishers (such as those based 
on Ringer's dextrose). 

20 Formulations for topical (including application to a mucosal surface, 

including the mouth, pulmonary, nasal, vaginal or rectal) administration may 
include ointments, lotions, creams, gels, drops, suppositories, sprays, liquids and 
powders. Formulations for these applications are known. For example, a number 
of pulmonary formulations have been developed, typically using spray drying to 

25 formulate a povi^der having particles with an aerodynanmic diameter of between 
one and three nriicrons, consisting of drug or drug in combination with polymer 
and/or surfactant. 

Compositions for oral administration include powders or granules, 
suspensions or solutions in water or non-aqueous media, capsules, sachets, or 

30 tablets. Thickeners, flavorings, diluents, emulsifiers, dispersing aids or binders 
may be desirable. 
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Peptides as described herein can also be administered as a 
pharmaceutical] y acceptable acid- or base- addition salt, formed by reaction with 
inorganic acids such as hydrochloric acid, hydrobromic acid, perchloric acid, 
nitric acid, thiocyanic acid, sulfuric acid, and phosphoric acid, and organic acids 
5 such as formic acid, acetic acid, propionic acid, glycolic acid, lactic acid, 

pyruvic acid, oxalic acid, malonic acid, succinic acid, maleic acid, and fumaric 
acid, or by reaction with an inorganic base such as sodium hydroxide, 
ammonium hydroxide, potassium hydroxide, and organic bases such as mono-, 
di-, trialkyl and aryl amines and substituted ethanolamines. 

10 Dosages 

Dosing is dependent on severity and responsiveness of the condition to be 
treated, but will normally be one or more doses per day, with course of treatment 
lasting from several days to several months or until the attending physician 
determines no fiarther benefit will be obtained. Persons of ordinary skill can 

1 5 determine optimum dosages, dosing methodologies and repetition rates. 

The dosage ranges are those large enough to produce the desired effect in 
which the symptoms of the memapsin 2 mediated disorder are alleviated 
(typically characterized by a decrease in size and/or number of amyloid plaque, 
or by a failure to increase in size or quantity), or in which cleavage of the alpha- 

20 beta42 peptide is decreased. The dosage can be adjusted by the individual 
physician in the event of any counterindications. 

The present invention will be further understood by reference to the 
following non-limiting examples. 
Example 1. Cloning of memapsin 2. 

25 1. Cloning and nucleotide sequence of pro-memapsin 2, 

New sequences homologous to human aspartic proteases were found in 
the following entries in the EST IMAGE database: AAl 36368 pregnant uterus 
ATCC 947471, AA207232 neurepithelium ATCC 214526, and R55398 human 
breast ATCC 392689. The corresponding bacterial strains: #947471, #214526, 

30 and # 392689 containing the EST sequences were obtained from the ATCC 
(Rockville, MD). The sequencing of these clones obtained from ATCC 
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confirmed that they contained sequences not identical to known human aspartic 
proteases. The completed sequences of these clones assembled into about 80% 
of prepro-M2 cDNA. Full length cDNAs of these clones were obtained using 
the following methods. 
5 The Human Pancreas Marathon-Ready cDNA (Clontech), which is 

double-strand cDNA obtained by reverse-transcription, primer addition, and 
second strand s^^oithesize of mRNA from human tissues, was used as template 
for PGR amplification. An adapter primer (API) and a nested adapter primer 
(AP2) were used for 5'- and 3'-RACE PGR. For PGR the 5'-region of the 
10 memapsin 2 cDNA, primers API and NHASPRl were used. Primers for the 3'- 
end of the cDNA are NHASPF2 and API. The middle of the cDNA was 
ampUfied by primers NHASPFl and NHASPR2. The sequence for the primers 
is as follows: NHASPFl : GGTAAGGATGGGCCATGGGGCCAACGTC (SEQ 
ID NO: 10), 

1 5 NHASPRl : GACGTTGGGGCCATGGGGGATGCTTACC (SEQ ID NO: 1 1 ), 
NHASPF2: ACGTTGTCTTTGATCGGGCCCGAAAACGAATTGG (SEQ ID 
NO: 12), 

NHASPR2: CCAATTCGTTTTCGGGCCCGATCAAAGACAACG (SEQ ID 
NO: 13), 

20 API : CCATCCTAATACGACTCACTATAGGGC (SEQ ID NO: 14), and 
AP2: ACTCACTATAGGGCTCGAGCGGC (SEQ ID NO: 15) 

Memapsin 2 was also cloned from a human pancreas library (Quick- 
Screen Human cDNA Library Panel) contained in lambda-gtlO and lambda-gtl 1 
vectors. The primers from the vectors, GTIOFWD, GTIOREV, GTl IFWD, and 

25 GTl IREV, were used as outside primers. The sequence of the primers used was: 
GTIOFWD: CTTTTGAGCAAGTTCAGCCTGGTTAA (SEQ ID NO: 16), 
GTIOREV: GAGGTGGCTTATGAGTATTTCTTCCAGGGTA (SEQ ID 
NO: 17), 

GTl IFWD: TGGCGACGACTCCTGGAGCCCG (SEQ ID N0:18), 
30 GTl IREV: TGACACCAGACCAACTGGTAATGG (SEQ ID NO: 19). 
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In addition, memapsin 2 cDNA was amplified directly from the human 
pancreatic lambda-gtlO and lambda-gtl 1 libraries. The sequence of the primers 
was: PASPNl : catatgGCGGGAGTGCTGCCTGCCCAC (SEQ ID NO:20) 
and 

5 NHASPCl : ggatccTCACTTCAGCAGGGAGATGTCATCAGCAAAGT (SEQ 
ID N0:21). 

The amplified memapsin 2 fragments were cloned into an intermediate 
PGR vector (Invitrogen) and sequenced. 

The assembled cDNA from the fragments, the nucleotide and the 
10 deduced protein sequence are shown in SEQ ID NO 1 and SEQ ID NO 2. 

Pro-memapsin 2 is homologous to other human aspartic proteases. 
Based on the alignments, Pro-memapsin 2 contains a pro region, an aspartic 
protease region, and a trans-membrane region near the C-terminus. The active 
enzyme is memapsin 2 and its pro-enzyme is pro-memapsin 2. 
1 5 Example 2. Distribution of memapsin 2 in human tissues. 

Multiple tissue cDNA panels from Clontech were used as templates for 
PGR amplification of a 0.82 kb fragment of memapsin 2 cDNA. The primers 
used for memapsin 2 were NHASPFl and NHASPR2. Tissues that contain 
memapsin 2 or fragments of memapsin 2 yielded amplified PGR products. The 
20 amount of amplified product indicated that memapsin 2 is present in the 

following organs from most abundant to least abundant: pancreas^ brain, lung, 
kidney, liver, placenta, and heart. Memapsin 2 is also present in spleen, 
prostate, testis, ovary, small intestine, and colon cells. 

Example 3. Expression of pro-memapsin 2 cDNA in E, coli, refolding and 
25 puriiBcation of pro-memapsin 2. 

The pro-memapsin 2 was PGR amplified and cloned into the BamRl site 
of a pETl la vector. The resulting vector expresses pro-memapsin 2 having a 
sequence from .Ala-8p to Ala 326. Figure 1 shows the construction of two 
expression vectors, pETl 1 -memapsin 2-Tl (hereafter Tl) and pETl 1 -memapsin 
30 2-T2 (hereafter T2). In both vectors, the N-terminal 1 5 residues of the 

expressed recombinant proteins are derived from the expression vector, Pro- 
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memapsin 2 residues start at residue Ala- 16. The two recombinant pro- 
memapsin 2s have different C-terminal lengths. Clone Tl ends at Thr- 454 and 
clone T2 ends at Ala-419. The Tl construct contains a C-terminal extension 
from the T2 construct but does not express any of the predicted transmembrane 
5 domain. 

Expression of recombinant proteins and recovery of inclusion bodies 
The Tl ^and T2 expression vectors were separately transfected into E. 
coli strain BL2jI(DE3). The procedures for the culture of transfected bacteria, 
induction for synthesis of recombinant proteins and the recovery and washing of 
10 inclusion bodies containing recombinant proteins are essentially as previously 
described (Lin et aL, 1994). 

Three different refolding methods have produced satisfactory results. 

(i) The rapid dilution method. 

Pro-memapsin 2 in 8 M urea/ 100 mM beta-mercaptoethanol with 
15 OD280nm = 5 was rapidly diluted into 20 volumes of 20 mM-Tris, pH 9.0. The 
solution was slowly adjusted into pH 8 with 1 M HCl. The refolding solution 
was then kept at 4° C for 24 to 48 hours before proceeding with purification. 

(ii) The reverse dialysis method 

An equal volume of 20 mM Tris, 0.5 mM oxidized/ 1.25 mM reduced 
20 glutathione, pH 9.0 is added to rapidly stirred pro-memapsin 2 in 8 M urea/10 
mM beta-mercaptoethanol with OD280 nm = 5. The process is repeated three 
more times with 1 hour intervals. The resulting solution is then dialyzed against 
sufficient volume of 20 mM Tris base so that the final urea concentration is 0,4 
M. The pH of the solution is then slowly adjusted to 8.0 with 1 M HCL 
25 iil The preferred method for refolding. 

Inclusion bodies are dissolved in 8 M urea, 0.1 M Tris, 1 mM Glycine, 1 
mM EDTA, 100 mM beta-mercaptoethanol, pH 10.0. The OD280 of the 
inclusion bodies are adjusted to 5.0 with the 8 M urea solution without beta- 
mercaptoethanol. The final solution contains the following reducing reagents: 
30 10 mM beta-mercaptoethanol, 10 mM DTT (Dithiothreitol), 1 mM reduced 

glutathion, and 0.1 M oxidized glutathion. The final pH of the solution is 10,0. 
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The above solution is rapidly diluted into 20 volumes of 20 mM Tris 
base, the pH is adjusted to 9.0, and the resulting solution is kept at 4 for 16 
hr. The solution is equilibrated to room temperature in 6 hr, and the pH is 
adjusted to 8.5. The solution is returned to 4 again for 18 hr, 

5 The solution is again equilibrated to room temperature in 6 hr, and the 

pH is adjusted to 8.0. The solution is returned to 4 again for 4 to 7 days. 

The refolding procedures are critical to obtain an enzymically active 
preparation which can be used for studies of subsite specificity of M2, to 
analyze inhibition potency of M2 inhibitors, to screen for inhibitors using either 

10 random structural libraries or existing collections of compound libraries, to 
produce crystals for crystallography studies of M2 structures, and to produce 
monoclonal or polyclonal antibodies of M2. 

Purification of recombinant pro-memapsin 2-12 

The refolded material is concentrated by ultrafiltration, and separated on 

1 5 a SEPHACRYL™ S-300 column equilibrated with 20 mM Tris.HCl, 0.4 M 
urea, pH 8.0. The refolded peak (second peak) from the S-300 colunm can be 
further purified with a FPLC RESOURCE-Q™ column, which is equilibrated 
with 20 mM Tris-HCl, 0.4 M urea, pH 8.0. The enzyme is eluted from the 
column with a linear gradient of NaCl. The refolded peak from S-300 can also 

20 be activated before ftirther purification. For activation, the fractions are mixed 
with equal volume 0.2 M Sodium Acetate, 70% glycerol, pH 4.0. The mixture is 
incubated at 22 for 1 8 hr, and then dialyzed twice against 20 volumes of 20 
mM Bis-Tris, 0.4 M urea, pH 6.0. The dialyzed materials are then further 
purified on a FPLC RESOURCE-Q™ column equilibrated with 20 Bis-Tris, 0.4 

25 M urea, pH 6.0. The enzyme is eluted with a linear gradient of NaCl. 

SDS-PAGE analysis of the S-300 fractions under reduced and non- 
reduced conditions indicated that Pro-memapsin 2 first elutes as a very high 
molecular weight band (greater than about 42 kD) under non-reduced 
conditions. This indicates that the protein is not folded properly in these 

30 fractions, due to disulfide cross linking of proteins. Subsequent fractions 

contain a protein of predicted pro-memapsin 2-T2 size (about 42 kDa). The pro- 
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enzyme obtained in these fractions is also proteolytically active for auto- 
catalyzed activation. These fractions were pooled and subjected to 
chromatography on the FPLC RESOURCE™ column eluted with a linear 
gradient of NaCL Some fractions were analyzed using SDS-PAGE under non- 

5 reducing conditions. The analysis showed that fractions 6 and 7 contained most 
of the active proteins, which was consistent with the first FPLC peak containing 
the active protein. The main peak was coupled to a shoulder peak, and was 
present with repeated purification with the same RESOURCE™ Q column. The 
main shoulder peaks were identified as active pro-memapsin 2 that exist in 

1 0 different conformations under these conditions. 

Example 4. Proteolytic activity and cleavage-site preferences of 
recombinant memapsin 2. 

The amino acid sequence around the proteolytic cleavage sites was 
determined in order to establish the specificity of memapsin 2. Recombinant 

15 pro-memapsin 2-Tl was incubated in 0.1 M sodium acetate, pH 4.0, for 16 
hours at room temperature in order to create autocatalyzed cleavages. The 
products were £inalyzed using SDS-polyacrylamide gel electrophoresis. Several 
bands which corresponded to molecular weights smaller than that of pro- 
memapsin 2 were observed. The electrophoretic bands were trans-blotted onto a 

20 PVDF membrane. Four bands were chosen and subjected to N-terminal 

sequence detennination in a Protein Sequencer. The N-terminal sequence of 
these bands established the positions of proteolytic cleavage sites on pro- 
memapsin 2. 

In addition, the oxidized fi-chain of bovine insulin and two different 
25 synthetic peptides were used as substrates for memapsin 2 to determine the 
extent of other hydrolysis sites. These reactions were carried out by auto- 
activated pro-memapsin 2 in 0.1 M sodium acetate, pH 4.0, which was then 
incubated v^th the peptides. The hydrolytic products were subjected to HPLC 
on a reversed phase C-18 column and the eluent peaks were subjected to 
30 electrospray mass spectrometry for the determination of the molecular weight of 
the fragments. Two hydrolytic sites were identified on oxidized insulin B -chain 
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(Table 1), Three hydrolytic sites were identified from peptide NCH-gamm A 
single cleavage site was observed in synthetic peptide PS 1 -gamma, whose 
sequence (LVNMAEGD) (SEQ ID N0:9) is derived from the beta-processing 
site of human presenilin 1 (Table 1). 
Table 1 : Substrate Specificity of Memapsin 2 



Site# 


Substrate 


P4 


P3 


P2 


PI 


pr 


P2' 


P3^ 


P4' 




1 


Pro- 

memapsin 2 


R 


G 


S 


M 


A 


G 


V 


L 


aa 12-18 of SEQ ID No.3 


2 


G 


T 


Q 


H 


G 


I 


R 


L 


aa 23-30 of SEQ ID No.3 


3 


S 


S 


N 


F 


A 


V 


G 


A 


aa 98-105 of SEQ ID No.3 


4 


G 


L 


A 


Y 


A 


E 


I 


A 


aa 183-190 of SEQ ID No.3 


5 


Oxidized 
insulin B- 
chain ' 


H 


L 




G 


S 


H 


L 


V 


C"^ is cysteic acid; 
SEQ ID No, 22 
SEQ ID No. 23 


6 




G 


E 


R 


G 


F 


F 


Y 


7 


Synthetic 
peptide'*' 








V 


G 


S 


G 


V 


Three sites cleaved in a peptide: 
VGSGVLLSRK (SEQ ID 
NO:30) 

SEQ ID No. 24 

SEQ ID No. 25 

SEQ ID No, 26 


8 




V 


G 


s 


G 


V 


L 


L 


9 


G 


V 


L 


L 


S 


R 


K 




10 


Peptide"^ * 


L 


V 


N 


M 


A 


E 


G 


D 


SEQ ID No. 9 



Example 5. Activation of pro-memapsin 2 and enzyme kinetics. 

Incubation in 0.1 M sodium acetate, pH 4.0, for 16 h at 22^C auto- 
catalytically converted pro-M2p^ to M2pd. For initial hydrolysis tests, two 
synthetic peptides were separately incubated with pro-Ml^d in 0.1 M Na acetate, 
pH 4.0 for different periods ranging from 2 to 18 h. The incubated samples 
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were subjected to LC/MS for the identification of the hydrolytic products. For 
kinetic studies, the identified HPLC (Beckman System Gold) product peaks 
were integrated for quantitation. The Km and kcat values for presenilin 1 and 
Swedish APP peptides (Table 1) were measured by steady- state kinetics. The 
5 individual Km and kcat values for APP peptide could not be measured accurately 
by standard methods, so its kcat/Km value was measured by competitive 
hydrolysis of mixed substrates against presenilin 1 peptide (Fersht, A. "Enzyme 
Structure and VIechanism", 2"^ Ed., W.H. Freeman and Company, New York. 
(1985)). 

10 The results are shown in Figures 2 A and 2B. The conversion of pro- 

M2pd at pH 4.0 to smaller fragments was shown by SDS -poly aery lamide 
electrophoresis. The difference in migration between pro-Ml^^ and converted 
enzyme is evident in a mixture of the two. Figure 2 A is a graph of the initial 
rate of hydrolysis of synthetic peptide swAPP (see Table 1) by M2pd at different 

15 pH, Figure 2B is a graph of the relative kcat/Km values for steady-state kinetic of 
hydrolysis of peptide substrates by M2pd. 
Example 6, Expression in Mammalian cells. 
Methods 

PM2 cE>NA was cloned into the EcoRV site of vector pSecTag A 
20 (Invitrogen). Human APP cDNA was PGR amplified from human placenta 8- 
gtl 1 library (Clontech) and cloned into the Nhel and^al sites of pSecTag A. 
The procedure for transfection into HeLa cells and vaccinia virus infection for 
T7-based expression are essentially the same as described by Lin, X., FASEB J. 
7:1070-1080(1993). 
25 Transfected cells were metabolically labeled with 200 microCi S 

methionine and cysteine (TransLabel; ICN) in 0.5 mi of serum-free/methionine- 
free media for 30 min, rinsed with 1 ml media, and replaced with 2 ml 
DMEM/10% FCS, In order to block vesicle acidification, Bafilomycin Al was 
included in the media (Perez, R.G., et al, J Biol Chem 271:9100-9107 (1996)). 
30 At different time points (chase), media was removed and the cells were 

harvested and lysed in 50 mM Tris, 0.3 M NaCl, 5 mM EDTA, 1% Triton X- 
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100, pH 7.4, containing 10 mM iodoacetamide, 10 :M TPCK, 10 :M TLCK, and 
2 microg/ml leupeptin. The supernatant (14,000 x g) of cell lysates and media 
were immunoadsorbed onto antibody bound to protein G sepharose (Sigma). 
Anti-APP N-terminal domain antibody (Chemicon) was used to recover the 
betaN-fragment of APP and anti-alpha-beta m? antibody (Chemicon, 
recognizing the N-terminal 17 residues of alpha-beta) was used to recover the 12 
kDa B C-fragment. The former antibody recognized only denatured protein, so 
media was first incubated in 2 mM dithiothrietol 0.1% SDS at 55°C for 30 min 
before immunoabsorption. Samples were cooled and diluted with an equal 
volume of cell lysis buffer before addition of anti-APP N-terminal domain 
(Chemicon). Beads were washed, eluted with loading buffer, subjected to SDS- 
PAGE (NOVEX^^) and visualized by autoradiogram or phosphorimaging 
(Molecular Dynamics) on gels enhanced with Amplify (Amersham). 
Immunodetection of the betaN-fragment was accomplished by transblotting onto 
a PVDF membrane and detecting with anti-alpha-betai-n and chemilviminescent 
substrate (Amersham). 
Results ., 

HeLa cells transfected with APP or M2 in 4-well chamber slides were 
fixed with acetone for 10 min and permeabilized in 0.2% Triton X-100 in PBS 
for 6 min. For locaUzing M2, polyclonal goat anti-;?TO-M2pci antibodies were 
purified on DE AE-sepharose 6B and affinity purified against recombinant pro- 
M2pd immobilized on Affigel (BioRad). Purified anti-j9ro-M2pd antibodies were 
conjugated to Alexa568 (Molecular Probes) according to the manufacturer's 
protocol. Fixed cells were incubated overnight with a 1 : 100 dilution of antibody 
in PBS containing 0.1% BSA and washed 4 times with PBS. For APP, two 
antibodies were used. Antibody A IJ (described above) and antibody Afi 17-42, 
which recognizes the first 26 residues following the beta-secretase cleavage site 
(Chemicon). After 4 PBS washes, the cells were incubated overnight with an 
anti-mouse FITC conjugate at a dilution of 1 :200. Cells were mounted in 
Prolong anti-ftide reagent (Molecular Probes) and visualized on a Leica TCS 
confocal laser scanning microscope. 
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Example 7: Design and Synthesis of OM99-1 and OM99-2, 

Based on the results of specificity studies of memapsin 2, it was 
predicted that good residues for positions PI and PT would be Leu and Ala. It 
was subsequently determined from the specificity data that PI' preferred small 

5 residues, such as Ala and Ser. However, the crystal structure (determined below 
in Example 9) indicates that this site can accommodate a lot of larger residues. 
It was demonstrated that PI' of memapsin 2 is the poshion with the most 
stringent specificity requirement where residues of small side chains, such as 
Ala, Ser, and Asp, are preferred. Ala was selected for PT mainly because its 

10 hydophobicity over Ser and Asp is favored for the penetration of the blood-brain 
barrier, a requirement for the design of a memapsin 2 inhibitor drug for treating 
Alzheimer's disease. Therefore, inhibitors were designed to place a transition- 
state analogue isostere between Leu and Ala (shown as Leu* Ala, where * 
represents the transition-state isostere, -CH(0H)-CH2-) and the subsite P4, P3, 

15 P2, P2*, P3' and P4' are filled with the beta-secretase site sequence of the 
Swedish mutant from the beta-amyloid protein. The structures of inhibitors 
OM99-1 and OM99-2 are shown below and in Figures 3 A and 3B, respectively: 

OM99-1 : Val-Asn-Leu*Ala-Ala-Glu-Phe (SEQ. ID NO. 27) 

20 OM99-2: Glu-Val-Asn-Leu*Ala-Ala-Glu-Phe (SEQ. ID NO. 28) 

The Leu'^Ala dipeptide isostere was synthesized as follows: 
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The Leu-Ala dipeptide isostere for the M2-inhibitor was prepared from 
L-leucine. As shown in Scheme 1, L-leucine was protected as its BOC- 
derivative 2 by treatment with BOC2O in the presence of 10% NaOH in diethyl 
ether for 12 h. Boc-leucine 2 was then converted to Weinreb amide 3 by 
treatment with isobutyl chcloroformate and N-methylpiperidine followed by 
treatment of the resulting mixed anhydride with N,0-dimethylhydroxylamine 



o 



1 



OH 



(Boc)20, 10% NaOH 



Et20, 97% 



1 



/BuOCOCI 
N-methyl piperidine, 
MeONHMe.HCI, 97 



o 

4 



LiAIH4, EtaO 
86% 



LDA, ethyl propiolate 
42% 




OMe 



COjEt 



1. H2, Pd/BaSC^ Nr^^V 

2. AcOH, PhMe ^ 1 \ 



P 



* O 




H 




10.7% 



7 62.8 % 



(Scheme 1) 

(Nahm and Weinreb, Tetrahedron Letters 1981, 32, 3815), Reduction of 3 with 
lithium aluminum hydride in diethyl ether provided the aldehyde 4. Reaction of 
the aldehyde 4 with lithium propiolate derived from the treatment of ethyl 



1251728 



37 



OMRF 182 
20487/252 



propiolate and lithium diisopropylamide afforded the acetylenic alcohol 5 as an 
inseparable mixture of diastereomers (5.8:1) in 42% isolated yield (Fray, Kaye 
and Kleinman, J. Org. Chem. 1986, 51, 4828-33). Catalytic hydrogenation of 5 
over Pd/BaS04 followed by acid-catalyzed lactonization of the resulting 
gamma-hydroxy ester with a catalytic amount of acetic acid in toluene at reflux, 
furnished the g£imma-lactone 6 and 7 in 73% yield. The isomers were separated 
by silica gel chiomatography by using 40% ethyl acetate in hexane as the eluent. 



LHMDS, Me!, 



THF 
70-76 % 




OH 



1. TFA:DCM 
(1:4).0°C 

2. Fmoc-OSu, 
Aq. NaHCO 3 

dioxane 




1. Aq. LiOH 

2. TBDMSCi, 
imidazole, 

DMF 



OH 



(Scheme 2) 

Introduction of the methyl group at C-2 was accomplished by stereoselective 
alkylation of 7 with methyl iodide (Scheme 2). Thus, generation of the dianion 

10 of lactone 7 with lithium hexamethyldisilazide (2.2 equivalents) in 

tetrahydrofuraiiL at -78°C (30 min) and alkylation with methyl iodide (1.1 
equivalents) for 30 min at -78°C, followed by quenching with propionic acid (5 
equivalents), provided the desired alkylated lactone 8 (76% yield) along with a 
small amoimt (less than 5%) of the corresponding epimer (Ghosh and Fidanze, 

15 1998 J. Org. Chem. 1998, 63, 6146-54). The epimeric cis-lactone was removed 
by colimm chromatography over silica gel using a mixture (3: 1) of ethyl acetate 
and hexane as the solvent system. The stereochemical assignment of alkylated 

lactone 8 was made based on extensive 1H-NMRN0E experiments. Aqueous 
lithium hydroxide promoted hydrolysis of the lactone 8 followed by protection 
20 of the gamma-hydroxyl group with /er^-butyldimethylsilyl chloride in the 
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presence of imidazole and dimethylaminopyridine in dimethylformamide 
afforded the acid 9 in 90% yield after standard work-up and chromatography. 
Selective removal of the BOC-group was effected by treatment with 
trifluoroacetic acid in dichloromethane at O^'C for 1 h. The resulting amine salt 
5 was then reacted with commercial (Aldrich, Milwaukee) Fmoc-succinimide 
derivative in dioxane in the presence of aqueous NaHCOs to provide the Fmoc- 
protected L*A isostere 10 in 65% yield after chromatography. Protected 
isostere 10 was utilized in the preparation of a random sequence inhibitor 
library, 

1 0 Experimental procedure 

N-(tert'Butoxycarhonyl)'L-Leucine (2), 

To the suspension of 10 g (76.2 mmol ) of L-leucine in 140 mL of 
diethyl ether was added 80 mL of 10 % NaOH. After all solid dissolves, 20 mL 
(87.1 mmol) of BOC2O was added to the reaction mixture. The resulting 

1 5 reaction mixture was stirred at 23 °C for 12 h. After this period, the layers were 
separated and the aqueous layer was acidified to pH 1 by carefiil addition of 1 N 
aqueous HCl at 0 ""C. The resulting mixture was extracted with ethyl acetate (3 x 
100 mL). The organic layers were combined and washed with brine and dried 
over anhydrous Na2S04. The solvent was removed under reduced pressure to 

20 provide title product which was used directly for next reaction without further 
purification (yield, 97 %). NMR (400 MHz, CDCI3) 5 4.89 (broad d, IH, J 
- 8.3 Hz), 4.31 (m, IH), 1.74-1.49 (m, 3H), 1.44 (s, 9H), 0.95 (d, 6H, J - 6.5 
Hz). 

N-(tert'Butoxycarbonyl)'L-leucme-N'-methoxy-N'-methyIa^ (3). 

25 To a stirred solution of N,0-dimethylhydroxy amine hydrochloride (5 .52 

g, 56.6 mmol) in dry dichloromethane (25 mL) under N2 atmosphere at O'^C, - 
methylpiperidine (6.9 mL, 56.6 mmol) was added dropwise. The resulting 
mixture was stirred at O'^C for 30 min. In a separate flask, N-(tert- 
butyloxycarbonyl)-L-leucine (1) (11 .9 g, 5 1 .4 mmol) was dissolved in a mixture 

30 of THF (45 mL) and dichloromethane (1 80 mL) under N2 atmosphere. The 
resulting solution was cooled to -20°C. To this solution was added 1- 
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methylpiperidme (6.9 mL, 56.6 mmol) followed by isobutyl chloroformate (7.3 
mL, 56.6 mmol). The resulting mixture was stirred for 5 minutes at -20°C and 
the above solution of N,0-dimethylhydroxy amine was added to it. The reaction 
mixure was kept -20 for 30 minutes and then warmed to 23''C. The reaction 

5 was quenched with water and the layers were seperated. The aqueous layer was 
extracted with dichloromethane (3 x 100 mL). The combined organic layers 
were washed with 10% citric acid, saturated sodium bicarbonate, and brine. The 
organic layer was dried over anhydrous Na2S04 and concentrated under the 
reduced pressure. The residue was purified by flash silica gel chromatography 

10 (25% ethyl acetate/hexane) to yield the title compound 3 (13.8 g, 97%) as a pale 
yellow oil. NMR (400 MHz, CDCI3) 6 5.06 (broad d, IH, J - 9.1 Hz), 4.70 
(m, IH), 3.82 (s, 3H), 3.13 (s, 3H), 1,70 (m, IH), 1.464.36 (m, 2H) 1.41 (s, 
9H), 0.93 (dd, 6H, J - 6.5, 14.2 Hz). 
N'(tert'Butoxycarbonyl)'L-leucinal (4), 

15 To a stiiTed suspension of lithium aluminum hydride (770 mg, 20.3 

mmol) in dry diethyl ether (60 mL) at -40 under N2 atmosphere, was added 
N-tert-butyloxycarbonyl-L-leucine-N'-methoxy-N'-methylamide (5.05 g, 18.4 
mmol) in diethyl ether (20 mL). The resulting reaction mixture was stirred for 
30 min. After this period, the reaction was quenched with 10% NaHS04 

20 solution (30 mL). The resulting reaction mixture was then warmed to 23°C and 
stirred at that temperature for 30 min. The resulting solution was filtered and the 
filter cake was washed by two portions of diethyl ether. The combined organic 
layers were washed with saturated sodium bicarbonate, brine and dried over 
anhydrous MgS04. Evaporation of the solvent under reduced pressure afforded 

25 the title aldehyde 4 (3.41 g) as a pale yellow oil. The resulting aldehyde was 
used immediately without further purification. ^H NMR (400 MHz, CDCI3) 8 
9.5 (s, IH), 4.9 (s, IH), 4.2 (broad m, IH), 1.8-1.6 (m, 2H), 1.44 (s, 9H), 1.49- 
1.39 (m, IH), 0.96 (dd, 6H, J = 2.7, 6.5 Hz). 

Ethyl (4S,5S)'and (4R,5S)'5-[(tert'Butoxycarbonyl)amino]-4-hydroxy-7- 
30 methyloct-2'ynoate (5), 
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To a stirred solution of diisopropylamine (1.1 mL, 7.9 mmol) in dry THF 
(60 mL) at 0°C under N2 atmosphere, was added n-BuLi (1.6 M in hexane, 4.95 
mL, 7.9 mmol) dropwise. The resulting solution was stirred at O^C for 5 min 
and then warmed to 23 °C and stirred for 15 min. The mixture was cooled to - 
5 78*^C and ethyl propiolate (801 |aL) in THF (2 mL) was added dropwise over a 
period of 5 min. The mixture was stirred for 30 min, after which N-Boc-L- 
leucinal 4 (L55 g, 7.2 mmol) in 8 mL of dry THF was added. The resulting 
mixture was stiirred at -78''C for 1 h. After this period, the reaction was 
quenched with acetic acid (5 mL) in THF (20 mL). The reaction mixure was 

10 warmed up to 23°C and brine solution was added. The layers were separated 
and the organic layer was washed with saturated sodium bicarbonate and dried 
over Na2S04. Evaporation of the solvent under reduced pressure provided a 
residue which was purified by flash silica gel chromatography (15 % ethyl 
acetate / hexane) to afford a mixture (3:1) of acetylenic alcohols 5 (0.96 g, 42 

15 %), ^H NMR (300 MHz, CDCI3) 6 4,64 (d, IH, J = 9.0 Hz), 4,44 (broad s, IH), 
4.18 (m, 2H ), 3.76 (m, IH), 1.63 (m, IH), 1.43-1.31 (m, 2H), 1.39 (s, 9H), 1.29- 
1.18 (m, 3H), 0.89 (m, 6H). 

(5SJ'S)-5-[l ''[(tert'Butoxycarbonyl)amino]-3 '-methylbutylj 
-dihydrofuran'2(3H)-one (7). 

20 To a stiirred solution of the above mixture of acetylenic alcohols (1 .73 g, 

5,5 mmol ) in ethyl acetate (20 mL) was added 5% Pd/BaS04 (1 g). The 
resulting mixture was hydrogenated at 50 psi for 1 .5 h. After this period, the 
catalyst was filtered off through a plug of Celite and the filtrate was 
concentrated under reduced pressure. The residue was dissolved in toluene (20 

25 mL) and acetic acid (100 \\L), The reaction mixure was refluxed for 6 h. After 
this period, the reaction was cooled to 23 ""C and the solvent was evaporated to 
give a residue which was purified by flash silica gel chromatography (40% 
diethyl ether / hexane) to yield the (5S, lS')-gamma-lactone 7 (0.94 g, 62.8 and 
the (5R, lS')-gamma-lactone 6 (0.16 g, 10.7 %). Lactone 7: ^H NMR (400 

30 MHz, CDCI3) 5 4,50-4.44 (m, 2H), 3.84-3.82 (m, IH), 2.50 (t, 2H, J = 7.8 Hz), 
2.22-2.10 (m, 2H), L64-1.31 (m, 3H), 1.41 (s, 9H), 0.91 (dd, 6H, J - 2.2, 6.7 
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Hz); ^^C NMR (75 MHz, CDCI3) 5 177.2, 156.0, 82.5, 79.8, 51.0, 42.2, 28,6, 
28.2, 24.7, 24.2, 23,0,21,9. 

(3RJSJ 'S)-5'[l '-[(tert-Butoxycarbonyl)amino)]-3 '-methylhut-yl] -3 -methyl 
dihydrofuran'2(3H)-one (8). 

5 To a stilted solution of the lactone 7 (45 1 .8 mg, 1 .67 mmol) in dry THF 

(8 mL) at -78*^C under N2 atmosphere, was added lithium hexamethyldisilazane 
(3.67 mL, 1,0 M in THF) over a period of 3 min. The resulting mixture was 
stirred at -78^C for 30 min to generate the lithium enolate. After this period, 
Mel (228 laL) v^as added dropwise and the resuhing mixture was stirred at - 

1 0 78°C for 20 min. The reaction was quenched with saturated aqueous NH4CI 
solution and was allowed to warm to 23°C. The reaction mixture was 
concentrated under reduced pressure and the residue was extracted with ethyl 
acetate (3 x 100 mL). The combined organic layers were washed with brine and 
dried over anhydrous Na2S04. Evaporation of the solvent afforded a residue 

1 5 which was purified by silica gel chromatography (1 5 % ethyl acetate / hexane) 
to furnish the alkylated lactone 8 (0,36 g, 76 %) as an amorphous solid. ^H 
NMR (300 MHz, CDCI3) 5 4,43 (broad t, IH, J - 6.3 Hz), 4.33 (d, IH, J - 9.6 
Hz), 3.78 (m, IH), 2.62 (m, IH), 2.35 (m, IH), 1,86 (m , IH), 1,63-1.24 (m, 
3H), 1.37 (s, 9H), 1.21 (d, 3H, J = 7.5 Hz), 0.87 (dd, 6H, J = 2.6, 6,7 Hz); '^C 

20 NMR (75 MHz, CDCI3) 8 180,4, 156.0, 80,3, 79.8, 51,6, 41.9, 34.3, 32.5, 28.3, 
24,7, 23.0,21,8,16,6. 

(2R, 4Sy 5S)-5-[ (tert-Butoxycarbonyl) amino ]-4-[ (tert-butyldimeth 
'ylsilyl)oxy J-2, 7-dimethyloctanoic acid (9), 

To a stirred solution of lactone 8 (0.33 g, 1.17 mmol ) in THF (2 mL) 

25 was added 1 N aqueous LiOH solution (5.8 mL), The resulting mixture was 

stirred at 23^C for 10 h. After this period, the reaction mixture was concentrated 
under reduced pressure and the remaining aqueous residue was cooled to O^C 
and acidified v/ith 25% citric acid solution to pH 4, The resulting acidic 
solution was extracted with ethyl acetate (3 x 50 mL). The combined organic 

30 layers were washed with brine, dried over Na2S04 and concentrated to yield the 
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corresponding hydroxy acid (330 mg) as a white foam. This hydroxy acid was 
used directly for the next reaction without further purification. 

To the above hydroxy acid (330 mg, LI mmol) in anhydrous DMF was 
added imidazole (1.59 g, 23.34 mmol) and tert-butyldimethylchlorosilane (1.76 

5 g, 1 1 .67 mmol). The resulting mixture was stirred at 23''C for 24 h. After this 
period, MeOH (4 mL) was added and the mixture was stirred for 1 h. The 
mixure was diluted with 25% citric acid (20 mL) and was extracted with ethyl 
acetate (3 x 20 mL). The combined extracts were washed with water, brine and 
dried over anhydrous Na2S04. Evaporation of the solvent gave a viscous oil 

10 which was purified by flash chromatography over silica gel (35% ethyl acetate / 
hexane) to afford the silyl protected acid 9 (0.44 g, 90 %). IR (neat) 3300-3000 
(broad ), 2955, 2932, 2859, 1711 cm"^; NMR (400 MHz, DMSO-d^ 343 K) 
delta 6.20 (bro£id s, 1 H), 3.68 (m, IH), 3.51 (broad s, IH), 2.49-2.42 (m, IH), 
L83 (t, IH, J - 10.1 Hz), 1.56 (m, IH), 1.37 (s, 9H), 1.28-1.12 (m, 3H), L08 (d, 

15 3H, J = 7.1 Hz), 0.87 (d, 3H, J - 6.1 Hz) 0.86 (s, 9 H), 0.82 (d, 3H, J = 6.5 Hz), 
0.084 (s, 3H), 0.052 (s, 3H). 

(2R, 4S, 5S)-5-[ (fluorenylmethyloxycarbonyl) amino ]-4-[ (tert-butyldi-methyl 
silyl)oxy]-2, 7'dimethyloctanoic acid (10). 

To a stirred solution of the acid 9 (0.17 g , 0.41 mmol) in 

20 dichloromethane (2 mL) at O^'C was added trifluoroacetic acid (500 i^L). The 
resulting mixture was stirred at O^^C for 1 h and an additional portion (500 ^iL) 
of trifluoroacetic acid was added to the reaction mixture. The mixture was 
stirred for an additional 30 min and the progress of the reaction was monitored 
by TLC. After this period, the solvents were carefiilly removed imder reduced 

25 pressure at a bath temperature not exceeding 5*^C. The residue was dissolved in 
dioxane (3 mL) and NaHCOs (300 mg) in 5 mL of H2O, To this solution was 
added Fmoc-succinimide (166.5 mg, 0.49 mmol) in 5 mL of dioxane. The 
resulting mixture was stirred at 23''C for 8 h. The mixure was then diluted with 
H2O (5 mL) and acidified with 25% aqueous citric acid to pH 4. The acidic 

30 solution was extracted with ethyl acetate (3 x 50 mL). The combined extracts 
were washed with brine, dried over Na2S04 and concentrated under reduced 
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pressure to give a viscous oil residue. Purification of the residue by flash 
chromatography over silica gel afforded the Fmoc-protected acid 10 (137 mg, 
61%) as a white foam. NMR (400 MHz, DMSO-d^ 343 K) 6 7.84 (d, 2H, J 
= 7.4 Hz), 7.66 (d, 2H, J = 8 Hz), 7.39 (t, 2H, J - 7.4 Hz), 7.29 (m, 2H), 6.8 (s, 

5 IH), 4.29-4.19 (m, 3H), 3.74-3.59 (m, 2H), 2.49 (m, IH), 1.88 (m, IH), 1.58 (m, 
IH), L31-1.17 (m, 3H), 1.10 (d, 3H, J = 7.1 Hz), 0.88 (s, 9H), 0.82 (d, 6H, J - 
6.2 Hz), 0.089 (s, 3 H), 0.057 (s, 3H). 

The synthesis of OM99-1 and OM99-2 were accomplished using solid 
state peptide synthesis procedure in which Leu* Ala was incorporated in the 

10 fourth step. The synthesized inhibitors were purified by reverse phase HPLC and 
their structure confirmed by mass spectrometry. 
Example 8. Inhibition of Meraapsin 2 by OM99-1 and OM99-2. 

Enzyme activity was measured as described above, but with the addition 
of either OM99-1 or OM99-2. OM99-1 inhibited recombinant memapsin 2 as 

15 shown in Figure 5 A. The Ki calculated is 3 x 10"^ M. The substrate used was a 
synthetic fluorogenic peptide substrate. The inhibition of OM99-2 on 
recombinant memapsin 2 was measured using the same fluorogenic substrate. 
The Ki value was determined to be 9.58 x 10'^ M, as shown in Figure 5B. 
These results demonstrate that the predicted subsite specificity is 

20 accurate and that inhibitors can be designed based on the predicted specificity. 

The residues in PI and PT are very important since the M2 inhibitor 
must penetrate the blood-brain barrier (BBB). The choice of Ala in PT 
facilitates the penetration of BBB. Analogues of Ala side chains will also work. 
For example, in addition to the methyl side chain of Ala, substituted methyl 

25 groups and groups about the same size like methyl or ethyl groups can be 

substituted for the Ala side chain. Leu at PI can also be substituted by groups of 
similar sizes or with substitutions on Leu side chain. For penetrating the BBB, it 
is desirable to make the inhibitors smaller. One can therefore use OM99-1 as a 
starting point and discard the outside subsites P4, P3, P3' and P4' . The retained 

30 structure Asn-Leu*Ala-Ala (SEQ ID NO:29) is then further evolved with 

substitutions for a tight-binding M2 inhibitor which can also penetrate the BBB. 
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Example 9, Crystallization and X-ray diffraction study of the protease 
domain of hvim;m memapsin 2 complexed to a specifically designed inhibitor, 
OM99-2, 

The crystallization condition and preliminary x-ray diffraction data on 
5 recombinant human memapsin 2 complexed to OM99-2 were determined. 
Production of Recombinant Memapsin 2 

About 50 mg of recombinant memapsin 2 was purified as described in 
Example 3. For optimal crystal growth, memapsin 2 must be highly purified. 
Memapsin 2 was over-expressed from vector pETl la-M2pd. This memapsin 2 

10 is the zymogen domain which includes the pro and catalytic domains to the end 
of the C-terminal extension but does not include the transmembrane and the 
intracellular domains. The vector was transfected into E. coli BL21 (DE3) and 
plated onto ZB agar containing 50 mg/liter ampicillin. A single colony was 
picked to inoculate 100 ml of liquid ZB containing 5 mg ampicillin and cultured 

15 at 30 ''C, for 18 hours, with shaking at 220 RPM. Aliquots of approximately 15 
ml of the overnight culture were used to inoculate each 1 liter of LB containing 
50 mg of ampicillin. Cultures were grown at 37 ^C, with shaking at 180 RPM, 
until an optical density at 600 nm near 0,8 was attained. At that time, expression 
was induced by addition of 1 19 mg of IPTG to each liter of culture. Incubation 

20 was continued for 3 additional hours post-induction. 

Bacteria were harvested, suspended in 50 mM Tris, 150 mM NaCl, pH 
7.5 (TN buffer), and lysed by incubation with 6 mg lysozyme for 30 minutes, 
followed by fi'eezing for 18 hours at -20 ^C. Lysate was thawed and made to 1 
mM MgCli then 1000 Kunitz units of DNAse were added with stirring, and 

25 incubated for 30 min. Volume was expanded to 500 ml with TN containing 0. 1 
% Triton X-100 (TNT buffer) and lysate stirred for 30 minutes. Insoluble 
inclusion bodies containing greater than 90% memapsin 2 protein were pelleted 
by centrifugation, and washed by resuspension in TNT with stirring for 1-2 
hours. Following three additional TNT washes, the memapsin 2 inclusion 

30 bodies were dissolved in 40 ml of 8 M urea, I mM EDTA, 1 mM glycine, 100 
mM Tris base, 100 mM beta-mercaptoethanlol (8 M urea buffer). Optical 
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density at 280 nm was measured, and volume expanded with 8 M urea buffer to 
achieve final O.D. near 0.5, with addition of sufficient quantity of beta- 
mercaptoethanol to attain 10 mM total, and 10 mM DTT, 1 mM reduced 
glutathione, 0.1 mM oxidized glutathione. The pH of the solution was adjusted 
5 to 10.0 or greater, and divided into four aliquots of 200 ml each. Each 200 ml 
was rapidly-diluted into 4 liters of 20 mM Tris base, with rapid stirring. The pH 
was adjusted inmiediately to 9.0, with 1 M HCl, and stored at 4 ^^C overnight. 
The following moming the diluted memapsin 2 solution was maintained at room 
temperature for 4-6 hours followed by adjusting pH to 8.5 and replacing the 

10 flasks to the 4 room. The same procedure was followed the next day with 
adjustment of pH to 8.0. 

This memapsin 2 solution was allowed to stand at 4 for 2-3 weeks. 
The total volume of approximately 16 liters was concentrated to 40 mis using 
ultra-filtration (Millipore) and stir-cells (Amicon), and centrifiiged at 140,000 xg 

15 at 30 minutes in a rotor pre-equilbrated to 4 ^C. The recovered supernatant was 
applied to a 2.5 x 100 cm column of S-300 equilibrated in 0.4 M urea, 20 mM 
Tris-HCl, pH 8.0, and eluted with the same buffer at 30 ml/hour. The active 
fi*action of memapsin 2 was pooled and further purified in a FPLC using a 1 ml 
Resource-Q (Pharmacia) column. Sample was filtered, and applied to the 

20 Resource-Q column equilibrated in 0.4 M urea, 50 mM Tris-HCl, pH 8.0, 

Sample was eluted with a gradient of 0 - 1 M NaCl in the same buffer, over 30 
ml at 2 ml/min. The eluents containing memapsin 2 appeared near 0.4 M NaCl 
which was pooled for crystallization procedure at a concentration near 5 mg/ml 
The amino-terminal sequence of the protein before crystallization 

25 showed two sequences starting respectively at residues 28p and 30p. Apparently, 
the pro peptide of recombinant pro-memapsin 2 had been cleaved during the 
preparation by a yet unidentified proteolytic activity. 

The activation of the folded pro-enzyme to mature enzyme, memapsin 2, 
was carried out as described above, i,e., incubation in 0.1 M sodium acetate pH 

30 4.0 for 16 hours at 22 ^C. Activated enzyme was fiirther purified using anion- 
exchange column chromatography on Resource-Q anion exchange column. The 
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purity of the eii;?yme was demonstrated by SDS-gel electrophoresis. At each 
step of the purification, the specific activity of the enzyme was assayed as 
described above to ensure the activity of the enzyme. 
Preliminarv CrvstaUization with OM99-2 

5 Crystal itrials were performed on purified memapsin 2 in complex with a 

substrate based transition-state inhibitor OM99-2 with a Ki = 10 nM. OM99-2 
is equivalent to eight amino-acid residues (including subsites S4, S3, S2, SI Sl\ 
S2', S3' and S4' in a sequence EVNLAAEF) with the substitution of the peptide 
bond between the SI and ST (L-A) by a transition-state isostere 

10 hydroxyethylene. Purified M2 was concentrated and mixed with 10 fold 
excessive molajr amount of inhibitor. The mixture was incubated at room 
temperature for 2-3 hours to optimize the inhibitor binding. The crystallization 
trial was conducted at 20 using the hanging drop vapor diffusion procedure. 
A systemic seai'ch with various crystallization conditions was conducted to find 

1 5 the optimum crystallization conditions for memapsin 2/OM99-2 inhibitor 

complex. For the first step, a coarse screen aimed at covering a wide range of 
potential conditions were carried out using the Sparse Matrix Crystallization 
Screen Kits pui'chased from Hampton Research. Protein concentration and 
temperature were used as additional variables. Conditions giving promising 

20 (micro) crystals were subsequently used as starting points for optimization, 
using fine grids of pH, precipitants concentration etc. 

Crystals of memapsin-inhibitor complex were obtained at 30% PEG 
8000, 0.1 M NaCocadylate, pH 6.4. SDS gel electrophoresis of a dissolved 
crystal verified that the content of the crystal to be memapsin 2. Several single 

25 crystals (with the sizes about 0.3 mm x 0.2 mm x 0.1 mm) were carefiilly 

removed from the cluster for data collection on a Raxis IV image plate. These 
results showed that the crystals diffract to 2.6 A. A typical protein diffraction 
pattern is shown in Figure 6. An X-ray image visualization and integration 
software-Denzo, was used to visualize and index the diffraction data. Denzo 

30 identified that llie primitive orthorhombic lattice has the highest symmetry with 
a significantly low distortion index. The unit cell parameters were determined 
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as: a=89.1 A, b==96,6 A, c-134.1 A, a-p=y=90°. There are two memapsin 
2/OM99-2 complexes per crystallographic asymmetric unit, the Vm of the crystal 
is 2.9 A^/Da, Diffraction extinctions suggested that the space group is P2i2i2i. 
With diffraction of the current crystal to 2.6 A, the crystal structure 

5 obtained from these data has the potential to reach atomic solution, i.e., the 
three-dimensional positions of atoms and chemical bonds in the inhibitor and in 
memapsin 2 can be deduced. Since memapsin 2 sequence is homologous with 
other mammalian aspartic proteases, e.g., pepsin or cathepsin D, it is predicted 
that the three dimensional structures of memapsin 2 will be similar (but not 

1 0 identical) to their structures. Therefore, in the determination of x-ray structure 
from the diffraction data obtained from the current crystal, it is likely the 
solution of the phase can be obtained from the molecular replacement method 
using the known crystal structure of aspartic proteases as the search model 
Fixrther Crystallization Studies 

1 5 Concentrated memapsin 2 was mixed with 1 0-fold molar excessive of 

the inhibitor. The mixture was incubated at room temperature for 2-3 hours to 
optimize inhibitor binding, and then clarified with a 0.2 micron filter using 
centrifugation. Crystals of memapsin 2-inhibitor complex were grown at 20 ''C 
by hanging drop vapor diffusion method using equal volumes of enzyme- 

20 inhibitor and well solution. Crystals of quality suitable for diffraction studies 
were obtained in two weeks in 0.1 M sodium cacodylate, pH 7.4, 0.2 M 
(NH4)2S04, and 22.5% PEG8000. The typical size of the crystals was about 0.4 
X 0.4 X 0.2 mmil 

Diffraction data were measured on a Raxis-IV image plate with a 

25 Rigaku X-ray generator, processed with the HKL program package [Z. 

Otwinowski, W. Minor, Methods EnzymoL 276, 307 (1997)] A single crystal 
of approximately 0.4 x 0.4 x 0.2 mm^ in size was treated with a cryo-protection 
solution of 25% PEG8000, 20% glycerol, 0.1 M sodium-cacodylate pH 6.6, and 
0.2 M (NH4)2S04, and then flash-cooled with liquid nitrogen to about -180 ""C 

30 for data collection. Diffraction was observed to at least 1 .9 A. The crystal form 
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belongs to space group P2i with two memapsin 2/OM99-2 complexes per 
crystallographic asymmetric unit and 56% solvent content. 

Molecular replacement was performed with data in the range of 15.0- 
3.5 A using program AmoRe, CCP4 package [Navaza, J., Acta Crystallog. 
5 Sect. A. 50, 157 (1994)]. Pepsin, a human aspartic protease with 22% sequence 
identity, was used as the search model(PDB id Ipsn). Rotation and translation 
search, followed by rigid body refinement, identified a top solution and 
positioned both molecules in the asymmetric unit. The initial solution had a 
correlation coefficient of 22% and an R-factor of 0.5 1 . The refinement was 

10 carried out using the program CNS [Brunger et al, Acta Crystallogr. Sect. D, 
54, 905 (1998)]. 10% of reflections were randomly selected prior to refinement 
for Rfree monitoring [Bruger, A.T., X-PLOR Version 3.1: A system for X-ray 
Crystallography and NMR, Yale University Press, New Haven, CT (1992)]. 
Molecular graphics program [Jones, T.A., et al, Improved methods for 

1 5 building protein models in electron denisty maps and location of errors in these 
models. Acta Crystallogr, Sect. A 47, 110 (1991)] was used for map display 
and model building. From the initial pepsin model, corresponding amino acid 
residues were changed to that of memapsin 2 according to sequence alignment. 
The side chain conformations were decided by the initial electron density map 

20 and a rotomer library. This model was refined using molecular dynamics and 
energy minimization function of CNS [Bruger, A.T., et al, Acta Crystallogr. 
Sect. D, 54, 905 (1998)]. The first cycle of refinement dropped the Rworking to 
41%) and the Rfree to 45%. At this stage, electron densities in the omit map 
clearly showed the inhibitor configuration in the active site cleft. Structural 

25 features unique to memapsin 2 in chain tracing, secondary structure, insertions, 
deletions and extensions (as compared to the search model) are identified and 
constructed in subsequent iterations of crystallographic refinement and map 
fitting. The inliibitor was built into the corresponding electron density. 

About 440 solvent molecules were then gradually added to the structure 

30 as identified in the |Fol-|Fcl map contoured at the 3 sigma level. Non- 

crystallographic symmetry restriction and averaging were used in early stages 
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of refinement and model building. Bulk solvent and anisotropic over-all B 
factor corrections were applied through the refinement. The final structure was 
validated by the program PROCHECK Laskowski, R. A. et al, J. AppL 
Crystallog. 26, 283 (1993) which showed that 95% of the residues are located 
in the most favored region of the Ramachandran plot. All the main chain and 
side chain parameters are within or better than the standard criteria. The final 
Rworking ^nd Rfree are 18% and 22% respectively. Refinement statistics are listed 
in Table 2. 
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Table 2. Data Collection and Refinement Statistics 



A. Data Statistics 

Space group P2i 

5 Unit Cell (a, b, and c in ) 53.7,85.9,109.2 

(, ,and in degrees) 90.0,101.4,90.0 

Resolution ( ) 25.0-1.9 

Number of observed reflections 1 44, 1 64 

Number of unique reflections 69,056 

10 Rmerge' 0.061 (0.25) 

Data completeness (%) (25.0-1 .9 ) 90.0 (68.5) 

<!/ (I)> 13.7(3.0) 

B. Refinement Statistics 

Rworking^ 0.186 

15 Rfree*' 0.228 

RMS deviation from ideal values 

Bond length ( ) 0.014 

Bond angle (Deg) 1.7 

Number of water molecules 445 
20 Average B-factor ( ^) 

Protein 28.5 

Solvent 32.2 



^ Rmerge = hki i | Ihki, i " <Ihki> | / hki <Ihki>, where Ihki, i is the intensity of the 
25 ith measurement and <Ihki> is the weighted mean of all measurements of Ihki- 
^ Rworking (free) ^ I Fo| " I Fc| / | Fo], whcre Fo and Fc are the observed and 
calculated structure factors. Numbers in parentheses are the corresponding 
numbers for the highest resolution shell (2.00-1 .9 A). Reflections with 
Fo/ (Fo)>=0.0 are included in the refinement and R factor calculation. 
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Memapsin 2 Crystal Structure. 

The bilobal structure of memapsin 2 (Figure 7) is characteristic of 
aspartic proteases (Tang, J., et aL, Nature 271, 618-621 (1978)) with the 
conserved folding of the globular core. The substrate binding cleft, where the 
5 inhibitor is bound (Figure 7), is located between the two lobes, A pseudo two- 
fold symmetry between the N- (residues 1-180) and C- (residues 181-385) 
lobes (Figure 7), which share 61 superimposable atoms with an overall 2.3 A 
rms deviation using a 4 A cutoff. The corresponding nximbers for pepsin are 67 
atoms and 2.2 A. Active-site Asp^^ and Asp^^^ and the surrounding hydrogen- 

10 bond network aire located in the center of the cleft (Figure 7) and are conserved 
with the typical active-site conformation (Davies, D. R., Annu. Rev. Biophys. 
Chem. 19, 189 (1990)). The active site carboxyls are, however, not co-planar 
and the degree of which (50^) exceeds those observed previously. 

Compai-ed to pepsin, the conformation of the N-lobe is essentially 

15 conserved (Sielecki et al., 1990), The most significant structural differences are 
the insertions and a C-terminal extension in the C-lobe, Four insertions in 
helices and loops (Figure 7) are located on the adjacent molecular surface. 
Insertion F, which contains four acidic residues, is the most negatively charged 
surface on the molecule. Together, these insertions enlarged significantly the 

20 molecular boundary of memapsin 2 as compared to pepsin (Figure 8). These 
surface structujral changes may have fiinction in the association of memapsin 2 
with other cell surface components. Insertions B and E are located on the other 
side of the molecule (Figure 7). The latter contains a beta-strand that paired 
with part of the C-terminal extension G. A six- residue deletion occurs at 

25 position 329 on a loop facing the flap on the opposite side of the active-site 
cleft, resulting in an apparently more accessible cleft. Most of the C-terminal 
extension (residues 359-393) is in highly ordered structure. Residues 369-376 
form a beta structure with 7 hydrogen bonds to strand 293-299, while residues 
378-383 form a helix (Figures 7 and 8). Two disulfide pairs (residues 155/359 

30 and 217/382) imique to memapsin 2 fasten both ends of the extension region to 
the C-lobe. This C-terminal extension is much longer than those observed 
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previously and is conformationally different [Cutfield, S. M., et al.^ Structure 3, 
1261 (1995); Abad-Zapatero, C, et al, Protein Sci. 5, 640 (1996); Symersky, 
J. et ah, Biochemistry 36, 12700 (1997); Yang, J., et al, Acta Crystallogr. D 55, 
625 (1999)]. The last eight residues (386-393) are not seen in the electron 
5 density map; they may form a connecting stem between the globular catalytic 
domain and the membrane anchoring domain. 

Of the 21 putative pro residues only the last six, 43p-48p, are visible in 
the electron density map. The remainders are likely mobile. Pro-memapsin 
expressed in m;mmialian cell culture has an N-terminus position at Glu^^^. 

10 Hov^ever, an Arg-Arg sequence present at residues 43p-44p is a frequent signal 
for pro-protein processing, e.g., in prorenin (Corvol, P. et al., Hypertension 5, 
13-9 (1983)). Recombinant memapsin 2 derived from this cleavage is fully 
active. The mobility of residues 28p-42p suggests that they are not part of the 
structure of mature memapsin 2. 

15 Memapsin 2-OM99-2 Interaction. 

The binding of the eight-residue inhibitor OM99-2 in the active-site 
cleft shares some structural features v^ith other aspartic protease-inhibitor 
complexes [Davies, D.R., Annu. Rev. Biophys. Chem. 19, 189 (1990); Bailey 
and Cooper, (1994); DeaMs et aL, (1994)]. These include four hydrogen bonds 

20 between the two active-site aspartics to the hydroxyl of the transition-state 
isostere, the covering of the flap (residues 69-75) over the central part of the 
inhibitor and ten hydrogen bonds to inhibitor backbone (Figure 9). Most of the 
latter are highly conserved among aspartic proteases [Davies, D. R. Annu. Rev. 
Biophys. Chem. 19, 189 (1990); Bailey and Cooper, (1994); Dealwis et al, 

25 (1 994)] except that hydrogen bonds to Gly and Tyr^^^ are unique to memapsin 
2. These obser/ations illustrate that the manner by which memapsin 2 
transition-state template for substrate peptide backbone and mechanism of 
catalysis are similar to other aspartic proteases. These common features are, 
however, not the decisive factors in the design of specific memapsin 2 

30 inhibitors v^dth high selectivity. 
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The observation important for the design of inhibitor drugs is that the 
memapsin 2 residues in contact with individual inhibitor side chains (Figure 9) 
are quite different from those for other aspartic proteases. These side chain 
contacts are important for the design of tight binding inhibitor with high 
5 selectivity. Five N-terminal residues of OM99-2 are in extended conformation 
and, with the exception of Pi* Ala, all have clearly defined contacts (within 4 A 
of an inhibitor side chain) with enzyme residues in the active-site cleft (Figure 
9). 

The protease S4 subsite is mostly hydrophilic and open to solvent. The 

1 0 position of inhibitor P4 Glu side chain is defined by hydrogen bonds to Gly ^ ^ 
and to P2 Asn (Figure 9) and the nearby sidechains of Arg'^^^ and Arg^^'^, which 
explains why the absence of this residue from OM99-2 cause a 10-fold increase 
in Kj. Likewise, the protease S2 subsite is relatively hydrophilic and open to 
solvent. Inhibitor P2 Asn side chain has hydrogen bonds to P4 Glu and Arg^^^. 

1 5 The relatively small S2 residues Ser^^^ and Ser"^^^ (Gin and Met respectively in 
pepsin) may fit a side chain larger than Asn, Memapsin 2 Si and S3 subsites, 
which consist mostly of hydrophobic residues, have conformations very 
different from pepsin due to the deletion of pepsin helix hH2 (Dealwis, et al., 
(1994)). The inhibitor side chains of P3 Val and Pi Leu are closely packed 

20 against each other and have substantial hydrophobic contacts with the enzyme 
(Figure 9), especially P3 interacts with Tyr^^ and Phe^^^. In the beta- secretase 
site of native APP, the P2 and Pi residues are Lys and Met respectively. 
Swedish mutant APP has Asn and Leu in these positions respectively, resulting 
in a 60-fold increase of kcat/Km over that for native APP and an early onset of 

25 AD described by Mullan, M., et al. [Nat. Genet. 2, 340 (1992)]. The current 
structure suggests that inhibitor P2 Lys would place its positively charge in an 
unfavorable interaction with Arg^"^^ with a loss of hydrogen bond to Arg^^^, 
while Pi Met would have less favorable contact with memapsin 2 than does 
leucine in this site (Figure 10). No close contact with memapsin 2 was seen for 

30 Pi' Ala and an aspartic at this position, as in APP, may be accommodated by 
interacting with Arg^^^. 
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The direction of inhibitor chain turns at P2' and leads P3' and P4' toward 
the protein surface (Figure 10). As a result, the side-chain position of P2' Ala 
deviates from the regular extended conformation. The side chains of P3' Glu 
and P4' Phe are both pointed toward molecular surface with little significant 
5 interaction with the protease (Figure 10). The relatively high B-factors (58.2 A ^ 
for Glu and 75.6 A ^ for Phe) and less well-defmed electron density suggests 
that these two residues are relatively mobile, in contrast to the defined structure 
of the S3' and S4' subsites in renin-inhibitor (CH-66) complex (Dealwis et al., 
1994). The topologically equivalent region of these renin subsites (residues 292 
10 - 297 in pepsin numbering) is deleted in memapsin 2. These observations 

suggest that the conformation of three C-terminal residues of OM99-2 may be a 
functional feature of memapsin 2, possibly a way to lead a long protein 
substrate out of the active-site cleft. 

Example 10: Using The Crystal Structure to Design Inhibitors. 

15 Pharmaceutically acceptable inhibitor drugs normally post a size limit 

under 800 daltons. In the case of memapsin 2 inhibitors, this requirement may 
even be more s tringent due to the need for the drugs to penetrate the blood-brain 
barrier [Kearney and Aweeka, (1999)]. In the current model, well defined 
subsite structures spending P4 to P2' provide sufficient template areas for rational 

20 design of such drugs. The spacial relationships of individual inhibitor side chain 
vdth the corresponding subsite of the enzyme as revealed in this crystal structure 
permits the design of new inhibitor structures in each of these positions. It is 
also possible to incorporate the unique conformation of subsites P2', P3' and P4' 
into the selectivity of memapsin 2 inhibitors. The examples of inhibitor design 

25 based on the cx^irrent crystal structure are given below. 

Example A: Since the side chains of P3 Val and Pi Leu are packed against 
each other and there is no enzyme structure between them, cross-linking these 
side chains would increase the binding strength of inhibitor to memaspin 2. 
This is because when binding to the enzyme, the cross-linked inhibitors would 

30 have less entropy difference between the free and bound forms than their non- 
cross-linked counterparts [Khan, A.R., et al., Biochemistry, 37, 16839 
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(1998)]. Possible structures of the cross-linked side chains include those shown 
in Figure 1 1 . 

Example B: The same situation exits between the P4 Glu and P2 Asn. The 
current crystal structure shows that these side chains are already hydrogen 
5 bonded to each other so the cross linking between them would also derive 
binding benefit as described in the Example A. The cross-linked structures 
include those shown in Figure 12. 

Example C: Based on the current crystal structure, the P T Ala side chain may 
be extended to add new hydrophobic, Van der Waals and H-bond interactions. 

1 0 An example of such a design is diagramed in Figure 1 3 . 

Example D: Based on the current crystal structure, the polypeptide backbone 
in the region of PI, P2, and P3, and the side chain of PI -Leu can be bridged 
into rings by the addition of two atoms (A and B in Figure 14). Also, a methyl 
group can be added to the beta-carbon of the PI -Leu (Figure 14). 

1 5 Modifications and variations of the methods and materials described 

herein will be obvious to those skilled in the art and are intended to come within 
the scope of the appended claims. 
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We claim. 

1 . An inhibitor of catalytically active memapsin 2 which binds to 
the active site of the memapsin 2 defined by the presence of tv^o catalytic 
aspartic residues and substrate binding cleft. 
5 2. The inhibitor of claim 1 comprising an isostere of the active site 

of memapsin 2. 

3 . The inhibitor of claim 2 comprising a molecule having the 
general form X- L4-P4- L3-P3-L2-P2-Li-Pi-Lo-Pi'-Li'-P2'-L2'-P3'-L3'-P4'L4'-Y, 
wherein Px represent the substrate specificity position relative to the 
10 cleavage site wtiich is represented by an -L0-, and Lx represent the linking 
regions between each substrate specificity position, Px, and 

wherein Lo is a non-hydrolyzable bond and PI' is -R1CR3-, wherein Ri is 
a group smaller than CH2OH (side chain of serine), and at least two other P 
positions are a hydrophobic group. 
1 5 4. The inhibitor of claim 3 which is OM99- 1 . 



7 
8 

20 9 



-7 



5. The inhibitor of claim 3 which is OM99-2. 

6. The inhibitor of claim 3 having the structure of Figure 1 1 . 
The inhibitor of claim 3 having the structure of Figure 12, 
The inhibitor of claim 3 having the structure of Figure 13. 
The inhibitor of claim 3 having the structure of Figure 14, 

10. The inhibitor of claim 1 having an K, of less than or equal to 1 0 
M. 

1 1 . The inhibitor of claim 1 which binds to crystallized enzyme 
characterized by the parameters in Table 2 when bound to OM-99-2. 

25 1 2. The inhibitor of claim 1 1 having a Ki of less than or equal to 1 0"^ 

M. 

1 3 . The inhibitor of claim 1 1 having a Ki of less than or equal to 2 

nM. 

14. The inhibitor of claim 1 3 having a Ki of less than or equal to 1 

30 nM. 
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15. The inhibitor of claim 1 1 having a root mean square difference of 
less than or equal to 0.5 A for the side chain and backbone atoms for amino 
acids 18-379 of memapsin 2. 

16. The inhibitor of claim 1 which is permeable to the blood brain 

5 barrier. 

17. The inhibitor of claim 1 which blocks cleavage by memapsin 2 
under physiological conditions. 

18. The inhibitor of claim 1 which is a non-amino acid small 
molecule. 

10 19. The inhibitor of claim 1 8 having a molecular weight of less than 

800 Daltons. 

20. A method of synthesis of a Leu* Ala dipeptide isostere. 

21. A method for treating a patient to decrease the likelihood of 
developing or the progression of Alzheimer's disease comprising administering 

15 to the individual an effective amount of an inhibitor of memapsin 2 having an Ki 
of less than or equal to 10'"^ M or which binds to crystallized enzyme 
characterized by the parameters in Table 2 when bound to OM-99-2. 

22. The method of claim 21 wherein the inhibitor is administered 

orally. 

20 23 . The method of claim 21 wherein the inhibitor blocks cleavage of 

APP, 
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INHIBITORS OF MEMAPSIN 2 AND USE THEREOF 

Abstract 

Methods for the production of purified, catalytically active, recombinant 
memapsin 2 have been developed. The substrate and subsite specificity of the 

5 catalytically active enzyme have been determined. The substrate and subsite 
specificity inforaiation was used to design substrate analogs of the natural 
memapsin 2 substrate that can inhibit the function of memapsin 2. The substrate 
analogs are based on peptide sequences, shown to be related to the natural 
peptide substrates for memapsin 2. The substrate analogs contain at least one 

1 0 analog of an amide bond which is not capable of being cleaved by memapsin 2 . 
Processes for the synthesis of two substrate analogues including isosteres at the 
sites of the critical amino acid residues were developed and the substrate 
analogues, OMR99-1 and OM99-2, were synthesized. OM99-2 is based on an 
octapeptide Glu-Val-Asn-Leu-Ala-Ala-Glu-Phe (SEQ ID NO:28) with the Leu- 

1 5 Ala peptide bond substituted by a transition-state isostere hydroxyethylene 
group (Figure 1). The inhibition constant of OM99-2 isl.6 x 10"^ M against 
recombinant pro-memapsin 2, Crystallography of memapsin 2 bound to this 
inhibitor was used to determine the three dimensional structure of the protein, as 
well as the importance of the various residues in binding. This information can 

20 be used by those skilled in the art to design new inhibitors, using commercially 
available software programs and techniques familiar to those in organic 
chemistry and enzymology, to design new inhibitors to memapsin 2, useful in 
diagnostics and for the treatment and/or prevention of Alzheimer's disease. 
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I hereby claim foreign priority benefits under 35 U.S.C. 119(a)-(d) or 365(b) of any foreign application(s) for patent or inventor's 
certificate, or 365(a) of any PCT international application which designated at least one country other than the United States of 
America, listed below and have also identified below, by checking the box, any foreign application for patent or inventor's certificate, 
or of any PCT international application having a filing date before that of the application on which priority is claimed. 



Prior Foreign Application 
Number(s) 



Country 



Foreign Filing Date 
(MIVI/DD/YYYY) 



Priority 
Not Claimed 



Certified Copy Attached? 
YES NO 



□ 
□ 
□ 
□ 



□ 
□ 
□ 
□ 



□ 
□ 
□ 
□ 



D Additional foreign application numbers are listed on a supplemental priority data sheet PT0/SB/Q2B attached hereto: 



Application Numbeir(s) 


Filing Date (MM/DD/YYYY) 




60/141,363 
60/168,060 
60,177,836 


June 28, 1999 
November 30, 1999 
January 25, 2000 


Hn Additional provisional application 
numbers are listed on a 
supplemental priority data sheet 
PTO/SB/02B attached hereto. 


60/178,368 


January 27, 2000 





+ 
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OMRF 182 



Please type a plus sign (+) inside this box 



PTO/SB/01 (12-97) 
Approved for use tinrougli 9/30/00. 0MB 0651 -0032 
Patent and Trademark Office; U.S. DEPARTMENT OF COMMERCE 
Under the Papenvorl< Reduction Act of 1 995, no persons are required to respond to a collection of information unless it contains 
a valid 0MB control number. 



DECLARATION — Utility or Design Patent Application 



1 hereby claim the benefit under 35 U.S.C. 120 of any United States application (s), or 365(c) of any PCT international application designating the 
United States of America, listed below and, insofar as the subject matter of each of the claims of this application is not disclosed in the prior 
United States or PCT International application in the manner provided by the first paragraph of 35 U.S.C. 1 12, t acknowledge the duty to disclose 
information which is material to patentability as defined in 37 CFR 1 .56 which became available between the filing date of the prior application 
and the national or PCT international filing date of this application. 



U.S. Parent Application or PCT Parent 
Number 



Parent Filing Date 
(MM/DD/YYYY) 



Parent Patent Number 
(if applicable) 



□ Additional U.S. or PCT international application numbers are listed on a supplemental priority data sheet PTO/SB/02B attached hereto. 



As a named inventor, I hereby appoint the following registered practi tioner(s) to prosecute this app lication and to tra nsact all business in th e Patent 
and Trademark Office connected therewith: Q customer Number ' 



OR 



Registered practitioner(s) name/registration number listed below 



Place Customer 
Number Bar Code 



Name 



Registration 
Number 



Name 



Registration 
Number 



Patrea L, Pabst 
Robert A. Hodges 
Kevin W. King 



31,284 
41,074 
42,737 



Felipe J. Farley 
Zhaoyang Li 



38,445 
P-46,872 



□ Additional registered practitioner(s) named on supplemental Registered Practitioner Information sheet PTO/SB/02C attached hereto. 



Direct all correspondence to: □ Customer Number 

or Bar Code Label 



OR H Correspondence address below 



Name 



Address 



Patrea L. Pabst 



Amall Golden & Gregory, LLP 



Address 



2800 One Atlantic Center, 1201 West Peachtree Street 



City 



Atlanta 



state 



GA 



ZIP 



30309-3450 



Counti^ 



United States 



Telephone 



(404)873-8794 



Fax 



(404)873-8795 



I hereby declare that all statemenis made herein of my own knowledge are true and that ail statements made on information and belief are 
believed to be true; and further that these statements were made with the knowledge that willful false statements and the like so made are 
punishable by fine or imprisonment, or both, under 18 U.S.C. 1001 and that such willful false statements may jeopardize the validity of the 
application or any patent issued the reon. 



Name of Sole or First Inventor: 



□ A petition has been filed for this unsigned inventor 



Given Name (first and middle fit anvl) 



Family Nrimg nr Surnrimft 



Jordan J. N, 



Tang 



Inventor's 
Signature 



Date 



Residence: City 



Edmond 





OK 




state 


Country 



us 



Citizenship 



us 



Post Office Address 



1204 Leawood Drive 



Post Office Address 



City 



Edmond 



State 



OK 



ZIP 



73034 



Country 



US 



□ Additional inventors are being named on the supplemental Additional Inventor(s) sheet(s) PTO/SB/02A attached hereto 
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PTO/SB/02A (3-97) 
Approved for use through 9/30/98. 0MB 0651-0032 | 
Patent and Trademark Office; U.S. DEPARTMENT OF COMMERCE I 
Under the Paperwoik Reduction Act of 1995, no persons are required to respond to a collection of information unless it contains a 
valid 0MB control number. 



DECLARATION 



ADDITIONAL INVENTOR(S) 
Supplemental Sheet 

Page of 



Name of Additional Joint inventor, if any: 



n A petition has been filed for this unsigned inventor 



Given Name (first and middle [if any]) 



Family Name or Surname 



Lin 



Hong 



inventor's 
Signature 



Date 



Residence: City 



Oklahoma City 





OK 


state 





Country} 



us 



CH 



Citizensllip 



Post Office Address 



Post Office Address 



City 



Oklahoma City 



state 



OK 



ZIP 



73139 



Country 



US 



Name of Additional Joint Inventor, If any: 



n A petition has been filed for this unsigned inventor 



Given Name (first and middle [if any]) 



Family Name or Surname 



Arun K. 



Ghosh 



Inventor's 
Signature 



Residence: City 



River Forest 



state 



IL 



Country 



US 



Date 



Citizensiiip 



IN 



Post Office Address 



1407 Clinton Place 



Post Office Address 



City 



River Forest 



state 



JL 



ZIP 



60305 



Country 



US 



Name of Additional Joint Inventor, if any: 



n A petition has been filed for this unsigned inventor 



Given Name (first and middle [if any]) 



Family Name or Surname 



Inventor's 
Signature 



Date 



Residence: City 



State 



Country 



Citizenship 



Post Office Address 



Post Office Address 



City 



State 



ZIP 



Country 
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DECLARATION — Supplemental Priority Data Sheet 



Additional foreign appiications: 



Prior Foreign Application 
Number(s) 



Country 



Foreign Filing Date 
(MM/DD/YYYY) 



Priority 
l^ot Claimed 


Certified Copy Attach 
YES NO 


u 


□ 


□ 


□ 


□ 


□ 


□ 


□ 


Zl 


□ 


□ 


□ 


□ 


□ 


□ 


□ 


□ 


□ 


□ 


□ 


□ 


□ 


□ 


□ 


□ 


□ 


□ 


□ 


□ 


□ 


□ 


□ 


□ 




□ 


□ 


□ 


□ 


□ 


□ 


□ 


□ 


□ 


□ 


□ 



Additional provisional applications: 



Application Number 



Filing Date (IVIM/DD/YYYY) 



60/210,292 



June 8, 2000 



Additional U.S. applications: 



U.S. Parent Application 
Nunnber 



POT Parent 
Number 



Parent Filing Date 
(IVIIVI/DD/YYYY) 



Parent Patent Number 
(if applicable) 
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comments on the amount of time you are required to complete this form should be sent to the Chief Information Officer, Patent and Trademark Office, 
Washington, DC 20231. DO NOT SEND FEES OR COMPLETED FORMS TO THIS ADDRESS. SEND TO: Assistant Commissioner for Patents, 
Washington, DC 20231. 
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SEQUENCE LISTING 

<110> Tang, Jordan J.N. 
Hong, Lin 
Ghosh, Arun K. 

<120> Inhibitors of Memapsin 2 and Use Thereof 

<130> OMRF 182 

<140> Not Yet Assigned 
<141> 2000-06-27 

<150> 60/141,363 
<151> 1999-06-28 

<150> 60/168,060 
<151> 1999-11-30 

<150> 60/177,836 
<151> 2000-01-25 

<150> 60/178,368 
<151> 2000-01-27 

<150> 60/210,292 
<151> 2000-06-08 

<160> 31 

<17 0> Patentin Ver. 2.1 

<210> 1 

<211> 3252 

<212> DNA 

<213> Homo sapiens 

<400> 1 

gcgggagtgc tgcctgccca cggcacccag cacggcatcc ggctgcccct gcgcagcggc 60 
ctggggggcg cccccctggg gctgcggctg ccccgggaga ccgacgaaga gcccgaggag 120 
cccggccgga ggggcagctt tgtggagatg gtggacaacc tgaggggcaa gtcggggcag 180 
ggctactacg tggagatgac cgtgggcagc cccccgcaga cgctcaacat cctggtggat 240 
acaggcagca gtaactttgc agtgggtgct gccccccacc ccttcctgca tcgctactac 3 00 
cagaggcagc tgtccagcac ataccgggac ctccggaagg gtgtgtatgt gccctacacc 360 
cagggcaagt gggaagggga gctgggcacc gacctggtaa gcatccccca tggccccaac 420 
gtcactgtgc gtgccaacat tgctgccatc actgaatcag acaagttctt catcaacggc 480 
tccaactggg aaggcatcct ggggctggcc tatgctgaga ttgccaggcc tgacgactcc 540 
ctggagcctt tctttgactc tctggtaaag cagacccacg ttcccaacct cttctccctg 600 



1 



cagctttgtg gtgctggctt ccccctcaac 
agcatgatca ttggaggtat cgaccactcg 
atccggcggg agtggtatta tgaggtgatc 
ctgaaaatgg actgcaagga gtacaactat 
aaccttcgtt tgcccaagaa agtgtttgaa 
tccacggaga agttccctga tggtttctgg 
ggcaccaccc cttggaacat tttcccagtc 
aaccagtcct tccgcatcac catccttccg 
gccacgtccc aagacgactg ttacaagttt 
atgggagctg ttatcatgga gggcttctac 
ggctttgctg tcagcgcttg ccatgtgcac 
ccttttgtca ccttggacat ggaagactgt 
accctcatga ccatagccta tgtcatggct 
tgcctcatgg tgtgtcagtg gcgctgcctc 
gctgatgaca tctccctgct gaagtgagga 
ggaccacacc tccgtggttc actttggtca 
cagagcacct caggaccctc cccacccacc 
ggctggcaag gtgggttcca gggactgtac 
cactctgctg gcgggaatac tcttggtcac 
tgaaacttca gccctgaacc tttgtccacc 
cttcttttct tagtttcaga agtactggca 
tgtggtaccc tggcagagaa gagaccaagc 
gaggatgcac agtttgctat ttgctttaga 
ggtgcaaaga ttgcctcttg aattaaaaaa 
gcggctggaa agaggagaag gagagggagt 
aggaaaggca gaaacacaac cactcaccag 
catcccatct cagaagatgg gtgttgtttt 
accaaaagtg agatgggaag ggcttatcta 
tgaagtgccc actaagaagt tccacttaac 
tctctatctg aaccaccctt tattctacat 
cctaagctcc aggtgccctg tgggagagca 
tcctggtcat aggctcactc tttcccccaa 
ctaaaaggaa taggtaggag acctcttcta 
tcattcaaca gctgatgccc tataacccct 
agtagcaaga tctttacata attcagagtg 
cccctccatt tatttgacta aagcatcrca 
agaaatacag tgctttatgg ctctaacatt 
aaggatggca gcctcagggc ttccttatgt 
catctttttc ccctatcctg ttcttcccct 
gctggttctt gggctaggta gtggggacca 
agtaaactac ggtaccagtg ttagtgggaa 
tcctactcct acctggtcaa cccgctgctt 
acctgataag ggagagggaa atacaaggag 
gcccmcaagc cataaaccaa taaamcaaga 
aaaaaaaaaa aa 



cagtctgaag tgctggcctc tgtcggaggg 660 
ctgtacacag gcagtctctg gtatacaccc 720 
attgtgcggg tggagatcaa tggacaggat 780 
gacaagagca ttgtggacag tggcaccacc 840 
gctgcagtca aatccatcaa ggcagcctcc 900 
ctaggagagc agctggtgtg ctggcaagca 960 
atctcactct acctaatggg tgaggttacc 102 0 
cagcaatacc tgcggccagt ggaagatgtg 1080 
gccatctcac agtcatccac gggcactgtt 1140 
gttgtctttg atcgggcccg aaaacgaatt 1200 
gatgagttca ggacggcagc ggtggaaggc 1260 
ggctacaaca ttccacagac agatgagtca 1320 
gccatctgcg ccctcttcat gctgccactc 1380 
cgctgcctgc gccagcagca tgatgacttt 1440 
ggcccatggg cagaagatag agattcccct 1500 
caagtaggag acacagatgg cacctgtggc 1560 
aaatgcctct gccttgatgg agaaggaaaa 1620 
ctgtaggaaa cagaaaagag aagaaagaag 1680 
ctcaaattta agtcgggaaa ttctgctgct 1740 
attcctttaa attctccaac ccaaagtatt 1800 
tcacacgcag gttaccttgg cgtgtgtccc 1860 
ttgtttccct gctggccaaa gtcagtagga 1920 
gacagggact gtataaacaa gcctaacatt 1980 
aaactagatt gactatttat acaaatgggg 204 0 
acaaagacag ggaatagtgg gatcaaagct 2100 
tcctagtttt agacctcatc tccaagatag 2160 
caatgttttc ttttctgtgg ttgcagcctg 2220 
gccaaagagc tcttttttag ctctcttaaa 2280 
acatgaattt ctgccatatt aatttcattg 2340 
atgataggca gcactgaaat atcctaaccc 2400 
actggactat agcagggctg ggctctgtct 2460 
atcttcctct ggagctttgc agccaaggtg 2520 
tctaatcctt aaaagcataa tgttgaacat 2580 
gcctggattt cttcctatta ggctataaga 2640 
gtttcattgc cttcctaccc tctctaatgg 2700 
cagtggcact agcattatac caagagtatg 2760 
actgccttca gtatcaaggc tgcctggaga 2 820 
cctccaccac aagagctcct tgatgaaggt 2880 
ccccgctcct aatggtacgt gggtacccag 2940 
agttcattac ctccctatca gttctagcat 3000 
gagctgggtt ttcctagtat acccactgca 3060 
ccaggtatgg gacctgctaa gtgtggaatt 3120 
ggcctctggt gttcctggcc tcagccagct 3180 
atactgagtc taaaaaaaaa aaaaaaaaaa 3240 

3252 



<210> 2 



2 



<211> 488 
<212> PRT 

<213> Homo sapiens 
<220> 

<223> Purified Memapisin 2 
<220> 

<223> Amino Acids 28-48 are remnant putative propeptide 
residues 

<220> 

<223> Amino Acids 58-61, 78, 80, 82-83, 116, 118-121, 
156, 166, 174, 246, 274, 276, 278-281, 283, and 
376-377 are refsidues in contact with the OM9 9-2 
inhibitor 

<220> 

<223> Amino acids 54-57, 61-68, 73-80, 86-89, 109-111, 
113-118, 123-134, 143-154, 165-168, 198-202, and 
220-224 are N-lobe Beta Strands 

<220> 

<223> Amino Acids 184-191 and 210-217 are N-lobe Helices 
<220> 

<223> Amino acids 237-240, 247-249, 251-256, 259-260, 
273-275, 282-235, 316-318, 331-336, 342-348, 
354-357, 366-370, 372-375, 380-383, 390-395, 
400-405, and 418-420 are C-lobe Beta Strands 

<220> 

<223> Amino Acids 28(5-299, 307-310, 350-353, 384-387, 
and 427-431 are C-lobe Helices 

<400> 2 

Ala Gly Val Leu Pro Ala His Gly Thr Gin His Gly lie Arg Leu Pro 
15 10 15 

Leu Arg Ser Gly Leu Gly Gly Ala Pro Leu Gly Leu Arg Leu Pro Arg 
20 25 30 

Glu Thr Asp Glu Glu 3?ro Glu Glu Pro Gly Arg Arg Gly Ser Phe Val 
35 40 45 

Glu Met Val Asp Asn Leu Arg Gly Lys Ser Gly Gin Gly Tyr Tyr Val 
50 55 60 



3 



Glu Met Thr Val Gly I3er Pro Pro Gin Thr Leu Asn lie Leu Val Asp 
65 70 75 80 

Thr Gly Ser Ser Asn Phe Ala Val Gly Ala Ala Pro His Pro Phe Leu 
85 90 95 

His Arg Tyr Tyr Gin Arg Gin Leu Ser Ser Thr Tyr Arg Asp Leu Arg 
100 105 110 

Lys Gly Val Tyr Val Pro Tyr Thr Gin Gly Lys Trp Glu Gly Glu Leu 
115 120 125 

Gly Thr Asp Leu Val Ser lie Pro His Gly Pro Asn Val Thr Val Arg 
130 135 140 

Ala Asn lie Ala Ala lie Thr Glu Ser Asp Lys Phe Phe lie Asn Gly 
145 150 155 160 

Ser Asn Trp Glu Gly lie Leu Gly Leu Ala Tyr Ala Glu lie Ala Arg 
165 170 175 

Pro Asp Asp Ser Leu Glu Pro Phe Phe Asp Ser Leu Val Lys Gin Thr 
180 185 190 

His Val Pro Asn Leu Fhe Ser Leu Gin Leu Cys Gly Ala Gly Phe Pro 
195 200 205 

Leu Asn Gin Ser Glu Val Leu Ala Ser Val Gly Gly Ser Met lie lie 
210 215 220 

Gly Gly lie Asp His Ser Leu Tyr Thr Gly Ser Leu Trp Tyr Thr Pro 
225 230 235 240 

lie Arg Arg Glu Trp Tyr Tyr Glu Val lie lie Val Arg Val Glu lie 
245 250 255 

Asn Gly Gin Asp Leu Lys Met Asp Cys Lys Glu Tyr Asn Tyr Asp Lys 
260 265 270 

Ser lie Val Asp Ser Gly Thr Thr Asn Leu Arg Leu Pro Lys Lys Val 
275 280 285 

Phe Glu Ala Ala Val Lys Ser lie Lys Ala Ala Ser Ser Thr Glu Lys 
290 295 300 

Phe Pro Asp Gly Phe Trp Leu Gly Glu Gin Leu Val Cys Trp Gin Ala 
305 310 315 320 
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Gly Thr Thr Pro 



Gly Glu Val Thr 
340 

Tyr Leu Arg Pro 
355 

Lys Phe Ala He 
370 

He Met Glu Gly 
385 

Gly Phe Ala Val 



Ala Val Glu Gly 
420 

Asn He Pro Gin 
435 

Met Ala Ala He 
450 

Cys Gin Trp Arg 
465 

Ala Asp Asp He 



Trp Asn He Phe 
325 

Asn Gin Ser Phe 



Val Glu Asp Val 
360 

Ser Gin Ser Ser 
375 

Phe 'Tyr Val Val 
3 90 

Ser Ala Cys His 
405 

Pro Phe Val Thr 



Thr Asp Glu Ser 
440 

Cys 2^1 a Leu Phe 
455 

Cys Leu Arg Cys 
470 

Ser Leu Leu Lys 
485 



Pro Val He Ser 
330 

Arg He Thr He 
345 

Ala Thr Ser Gin 



Thr Gly Thr Val 
380 

Phe Asp Arg Ala 
395 

Val His Asp Glu 
410 

Leu Asp Met Glu 
425 

Thr Leu Met Thr 



Met Leu Pro Leu 
460 

Leu Arg Gin Gin 
475 



Leu Tyr Leu Met 
335 

Leu Pro Gin Gin 
350 

Asp Asp Cys Tyr 
365 

Met Gly Ala Val 



Arg Lys Arg He 
400 

Phe Arg Thr Ala 
415 

Asp Cys Gly Tyr 
430 

He Ala Tyr Val 
445 

Cys Leu Met Val 



His Asp Asp Phe 
480 



<210> 3 
<211> 503 
<212> PRT 

<213> Homo sapiens 
<220> 

<223> Pro-memapsin 2 
<220> 

<223> Amino Acids 1-15 are vector-derived residues 
<220> 

<223> Amino Acids 16-64 are a putative pro peptide 
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<220> 

<223> Amino Acids 1-13 are the T7 promoter 
<220> 

<223> Amino Acids 16-456 are Pro-memapsin 2-Tl 
<220> 

<223> Amino Acids 16-421 are Promemapsin 2-T2 
<400> 3 

Met Ala Ser Met Thr Gly Gly Gin Gin Met Gly Arg Gly Ser Met Ala 
15 10 15 

Gly Val Leu Pro Ala His Gly Thr Gin His Gly lie Arg Leu Pro Leu 
20 25 30 

Arg Ser Gly Leu Gly Gly Ala Pro Leu Gly Leu Arg Leu Pro Arg Glu 
35 40 45 

Thr Asp Glu Glu Pro Glu Glu Pro Gly Arg Arg Gly Ser Phe Val Glu 
50 55 60 

Met Val Asp Asn Leu Arg Gly Lys Ser Gly Gin Gly Tyr Tyr Val Glu 
65 70 75 80 

Met Thr Val Gly Ser Pro Pro Gin Thr Leu Asn lie Leu Val Asp Thr 
85 90 95 

Gly Ser Ser Asn Phe Ala Val Gly Ala Ala Pro His Pro Phe Leu His 
100 105 110 

Arg Tyr Tyr Gin Arg Gin Leu Ser Ser Thr Tyr Arg Asp Leu Arg Lys 
115 120 125 

Gly Val Tyr Val Pro ^Cyr Thr Gin Gly Lys Trp Glu Gly Glu Leu Gly 
130 135 140 

Thr Asp Leu Val Ser ICle Pro His Gly Pro Asn Val Thr Val Arg Ala 
145 150 155 160 

Asn lie Ala Ala lie Thr Glu Ser Asp Lys Phe Phe lie Asn Gly Ser 
165 170 175 

Asn Trp Glu Gly lie Leu Gly Leu Ala Tyr Ala Glu lie Ala Arg Pro 
180 185 190 

Asp Asp Ser Leu Glu Pro Phe Phe Asp Ser Leu Val Lys Gin Thr His 
195 200 205 
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Val Pro Asn Leu Phe Ser Leu Gin Leu Cys Gly Ala Gly Phe Pro Leu 
210 215 220 



Asn Gin Ser Glu Val Leu Ala Ser Val Gly Gly Ser Met lie lie Gly 
225 230 235 240 

Gly He Asp His Ser Leu Tyr Thr Gly Ser Leu Trp Tyr Thr Pro lie 
245 250 255 

Arg Arg Glu Trp Tyr Tyr Glu Val He He Val Arg Val Glu He Asn 
260 265 270 

Gly Gin Asp Leu Lys Met Asp Cys Lys Glu Tyr Asn Tyr Asp Lys Ser 
275 280 285 

He Val Asp Ser Gly Thr Thr Asn Leu Arg Leu Pro Lys Lys Val Phe 
290 295 300 

Glu Ala Ala Val Lys Ser He Lys Ala Ala Ser Ser Thr Glu Lys Phe 
305 310 315 320 

Pro Asp Gly Phe Trp Leu Gly Glu Gin Leu Val Cys Trp Gin Ala Gly 
325 330 . 335 

Thr Thr Pro Trp Asn He Phe Pro Val He Ser Leu Tyr Leu Met Gly 
340 345 350 

Glu Val Thr Asn Gin Ser Phe Arg He Thr He Leu Pro Gin Gin Tyr 
355 360 365 

Leu Arg Pro Val Glu Asp Val Ala Thr Ser Gin Asp Asp Cys Tyr Lys 
370 375 380 

Phe Ala He Ser Gin Ser Ser Thr Gly Thr Val Met Gly Ala Val He 
385 390 395 400 

Met Glu Gly Phe Tyr Val Val Phe Asp Arg Ala Arg Lys Arg He Gly 
405 410 415 

Phe Ala Val Ser Ala Cys His Val His Asp Glu Phe Arg Thr Ala Ala 
420 425 430 

Val Glu Gly Pro Phe Val Thr Leu Asp Met Glu Asp Cys Gly Tyr Asn 
435 440 445 

He Pro Gin Thr Asp Glu Ser Thr Leu Met Thr He Ala Tyr Val Met 
450 455 460 
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Ala Ala He Cys Ala Leu Phe Met Leu Pro Leu Cys Leu Met Val Cys 
465 470 475 480 



Gin Trp Arg Cys Leu Arg Cys Leu Arg Gin Gin His Asp Asp Phe Ala 
485 490 495 

Asp Asp He Ser Leu Leu Lys 
500 



<210> 4 
<211> 10 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 4 

Ser Glu Val Lys Met Asp Ala Glu Phe Arg 
15 10 



<210> 5 
<211> 10 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
Peptide 

<400> 5 

Ser Glu Val Asn Leu Asp Ala Glu Phe Arg 
15 10 



<210> 6 
<211> 8 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
Peptide 

<400> 6 
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Ser Val Asn Met Ala Glu Gly Asp 
1 5 



<210> 7 
<211> 12 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
Peptide 

<400> 7 

Lys Gly Gly Val Val He Ala Thr Val He Val Lys 
15 10 



<210> 8 
<211> 4 
<212> PRT 

<213> Homo sapiens 
<400> 8 

Asp Thr Ser Gly 
1 



<210> 9 
<211> 8 
<212> PRT 

<213> Homo sapiens 
<400> 9 

Leu Val Asn Met Ala Glu Gly Asp 
1 5 



<210> 10 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 10 

ggtaagcatc ccccatggcc ccaacgtc 
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<210> 11 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 11 

gacgttgggg ccatggggga tgcttacc 

<210> 12 
<211> 34 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 12 

acgttgtctt tgatcgggcc cgaaaacgaa ttgg 

<210> 13 
<211> 33 
<212> DNA 

<213> Artificial Seqiaence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 13 

ccaattcgtt ttcgggcccg atcaaagaca acg 



<210> 14 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 14 

ccatcctaat acgactcact atagggc 
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<210> 15 
<211> 23 
<212> DNA 

<213> Artificial Seqxience 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 15 

actcactata gggctcgagc ggc 23 



<210> 16 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 16 

cttttgagca agttcagcct ggttaa 2 6 



<210> 17 
<211> 31 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 17 

gaggtggctt atgagtattt cttccagggt a 31 



<210> 18 
<211> 22 
<212> DNA 

<213> Artificial Secpience 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 18 

tggcgacgac tcctggagcc eg 22 
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<210> 19 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 19 

tgacaccaga ccaactggta atgg 

<210> 20 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 20 

catatggcgg gagtgctgcc tgcccac 

<210> 21 
<211> 38 
<212> DNA 

<213> Artificial Seqi:ience 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 21 

ggatcctcac ttcagcaggg agatgtcatc agcaaagt 

<210> 22 
<211> 8 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Oxidized 
Insulin B- chain 

<220> 
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<223> Xaa at site 3 represents cysteic acid 
<400> 22 

His Leu Xaa Gly Ser His Leu Val 
1 5 



<210> 23 
<211> 8 
<212> PRT 

<213> Artificial Secpience 
<220> 

<223> Description of Artificial Sequence: Oxidized 
Insulin B-chain 

<220> 

<223> Xaa at site 1 represents cysteic acid 
<400> 23 

Xaa Gly Glu Arg Gly Phe Phe Tyr 
1 5 



<210> 24 
<211> 5 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
Peptide 

<400> 24 

Val Gly Ser Gly Val 

1 5 



<210> 25 
<211> 7 
<212> PRT 

<213> Artificial Sec[uence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
Peptide 

<400> 25 
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Val Gly Ser Gly Val Leu Leu 
1 5 



<210> 26 
<211> 7 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
Peptide 

<400> 26 

Gly Val Leu Leu Ser Arg Lys 
1 5 



<210> 27 
<211> 7 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Inhibitors 
<400> 27 

Val Asn Leu Ala Ala Glu Phe 
1 5 



<210> 28 
<211> 8 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Inhibitors 
<400> 28 

Glu Val Asn Leu Ala Ala Glu Phe 
1 5 



<210> 29 
<211> 4 
<212> PRT 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence: Synthetic 
Peptide 

<400> 29 
Asn Leu Ala Ala 
1 



<210> 30 
<211> 10 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
Peptide 

<400> 30 

Val Gly Ser Gly Val Leu Leu Ser Arg Lys 
15 10 



<210> 31 

<211> 326 

<212> PRT 

<213> Homo sapiens 

<220> 

<223> Amino acids 2-!5, 6-9, 13-20, 25-32, 65-67, 69-74, 
79-87, 89-91, 99-106, 119-122, 150-154, 164-167, 
180-183, 191-194, 196-199, 201-204, 210-214, 
221-223, 258-2(52, 265-269, and 275-278 are Beta 
Strands 

<220> 

<223> Amino acids 281-284, 286-288, 298-301, 310-315, 
and 319-324 are Beta strands 

<220> 

<223> Amino acids 48-51, 111-114, 136-142, 225-234, 
249-254, 271-274, and 303-306 are Helices 

<220> 

<223> Amino acids 12-13, 30, 32, 34-35, 73-77, 111, 117, 
120, 189, 213, 215, 217-220, 287, 289, 291, 298, 
and 3 00 are residues in contact with pepstatin. 
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<220> 

<223> Pepsin 



<400> 31 

Val Asp Glu Gin Pro ]jeu Glu Asn Tyr Leu Asp Met Glu Tyr Phe Gly 
15 10 15 

Thr He Gly He Gly Thr Pro Ala Gin Asp Phe Thr Val Val Phe Asp 
20 25 30 

Thr Gly Ser Ser Asn 3jeu Trp Val Pro Ser Val Tyr Cys Ser Ser Leu 
35 40 45 

Ala Cys Thr Asn His Asn Arg Phe Asn Pro Glu Asp Ser Ser Thr Tyr 
50 55 60 

Gin Ser Thr Ser Glu Thr Val Ser He Thr Tyr Gly Thr Gly Ser Met 
65 70 75 80 

Thr Gly He Leu Gly ^ryr Asp Thr Val Gin Val Gly Gly He Ser Asp 
85 90 95 

Thr Asn Gin He Phe Gly Leu Ser Glu Thr Glu Pro Gly Ser Phe Leu 
100 105 110 

Tyr Tyr Ala Pro Phe Asp Gly He Leu Gly Leu Ala Tyr Pro Ser He 
115 120 125 

Ser Ser Ser Gly Ala Thr Pro Val Phe Asp Asn He Trp Asn Gin Gly 
130 135 140 

Leu Val Ser Gin Asp Leu Phe Ser Val Tyr Leu Ser Ala Asp Asp Gin 
145 150 155 160 

Ser Gly Ser Val Val He Phe Gly Gly He Asp Ser Ser Tyr Tyr Thr 
165 170 175 

Gly Ser Leu Asn Trp Val Pro Val Thr Val Glu Gly Tyr Trp Gin He 
180 185 190 

Thr Val Asp Ser He Thr Met Asn Gly Glu Ala He Ala Cys Ala Glu 
195 200 205 

Gly Cys Gin Ala He Val Asp Thr Gly Thr Ser Leu Leu Thr Gly Pro 
210 215 220 

Thr Ser Pro He Ala Asn He Gin Ser Asp He Gly Ala Ser Glu Asn 
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225 



230 



235 



240 



Ser Asp Gly Asp 



Asp lie Val Phe 
260 

Ala Tyr lie Leu 
275 

Met Asn Leu Pro 
290 

Phe lie Arg Gin 
305 



Met Val Val Ser 
245 

Thr lie Asn Gly 



Gin Sler Glu Gly 
280 

Thr Glu Ser Gly 
295 

Tyr Phe Thr Val 

310 



Cys Ser Ala lie 
250 

Val Gin Tyr Pro 
265 

Ser Cys lie Ser 



Glu Leu Trp lie 
300 

Phe Asp Arg Ala 
315 



Ser Ser Leu Pro 
255 

Val Pro Pro Ser 
270 

Gly Phe Gin Gly 
285 

Leu Gly Asp Val 



Asn Asn Gin Val 
320 



Gly Leu Ala Pro Val Ala 
325 
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